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!)  ABSTRACT 

When  in  a  two-way  classification  model  the  block  affects  take  the  form 
v.  +  .  x  where  v.  and  $ .  are  unknown  constants,  j  —  1,  7,  ...,  r  where  r  is  the 

ninber^of  different  blocks,  and  X  is  a  known  covariate  (measured  without  error), 
a  test  for  block  differences  will  be  a  function  of  X.  Clearly,  if  X  =  0  a  test^ 
for  block  differences  would  test  differs nces  in  the  v . .  And  if  i X  j  is  infinitely 
large,  block  differences  would  be  caused  only  by  differences  in  the  <+> . 

The  model  in  question  might  be  written  as  ~  u  +  +  v  +  4*^  +  +  eij 

■  The  u ,  0,'s,  v.’s,  'I1 ,  and  ^'s  (j  *  1,  2,  ...»  r;  i  *  1»  2,  t)  are  unknown 

constants  and  constraints  are  placed  on  the  6's,  v.'s,  and  x.  .  is  the  value 

t*  h  ^  ^  ^  ^ 

of  the  covariate  X  corresponding  to  the  ii  outcome,  y^.  x_  is  assumed  to  be 

measured  without  error.  The  c.  ,’s  are  realizations  of  random  variables  assumed  to 
be  distributed  normally  and  independently  with  mean  0  and  variance  o  . 

Utilizing  this  normality  assumption  and  standard  least  squares  techniques,  a 
statistic  can  be  constructed  to  test  block  effects  which  will  have  an  exact  x  (or 
F  when  o2  is  unknown)  distribution.  This  statistic  is  a  function  of  X  and  is  termed 
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Efforts  to  solve  Ty(x)  «  C  for  X,  i.e.,  efforts  to  find  the  ."  values  for  which 
the  hypothesis  would  not  be  rejected  at  any  particular  level  of  teut  significance, 
reveal  that  T  (x)  places  different  weights  on  different  contrasts  of  the  blocks. 

m  an  effort  to  weight  all  contrasts  of  the  blocks  equally,  a  second  statistic, 
S  (x) ,  is  developed  which  has  the  same  mean  and  variance  as  a  Multiple  of  a  x  stat¬ 
istic  with  a  certain  number,  m(x),  degrees  of  freedom.  That  is 


S  (x) 

Y 


ra(x)  'm(x) 


The  statistic  SY(x)  appears  to  have  higher  power  against  general  alternatives 
of  interest  than  docs  TY(x).  The  equation  SY(x)  -  C  is  more  easily  solved  for  the 
values  of  X  such  that  SY(x)  *  c,  always  yielding  a  set  of  x’s  which  ir  a  single  inter 
val  or  the  complement  of  a  single  interval  on  the  real  line.  The  fact  that 

y2  is  a  function  of  X  and  not  a  constant  causes  little  practical  difficulty 
m(x)  Am(x) 

since  v2  is  a  close  approximation  of  r  ,  ie  many  cases. 

m(x)  m(x) 

Hence,  3  (x)  is  recommended  as  a  tost  statistic  for  dotermininq  whether  or  not 
blocks  are  significantly  different  at  any  particular  value  of  the  covariate  X. 
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CHAPTER  I 


INTRODUCTION 

y 

The  one  and  i.wo-way  classification  models  with  a  covariate  have  been 
treated  from  many  points  of  view.  However  the  question  of  the  covariate 
regression  parameter  changing  from  block  to  block  has  not  been  treated  so 
extensively.  That  such  changes  might  occur  is  certainly  logical.  Granting 
that  "yield"  might  increase  linearly  with,  say,  tenperature,  it  would  not 
be  at  all  surprising  to  find  certain  blocks,  perhaps  materials,  to  be  more 
responsive  to  temperature  than  other  blocks. 

In  the  analysis  of  variance  a  single  regression  effect  costs"  a 
single  degree  of  freedom.  If  the  regression  effects  change  from  block  to 
block  it  will  be  necessary  to  estimate,  or  adjust  for,  as  many  regression 
effects  as  there  are  blocks,  say  r.  Thus  the  additional  "cost"  of  block 
regression  effect  differences  is  r-1  degrees  of  freedom. 

As  a  tool  in  evaluating  the  adequacy  of  a  one-way  classification  mode] 
with  a  covariate,  Robson  and  Atkinson  [7]  propose  an  individual  degree  of 
freedom  test  for  the  homogeniety  of  regression  coefficients.  The  same  thing 
can  be  done  -n  the  two-way  model.  And  in  a  two  way  model  with  individual 
regression  coefficients  in  the  blocks  a  single  degree  of  freedom  test  can 
be  constructed  to  test  for  regression  coefficient  differences  among  the 
treatments.  An  easy  way  to  construct  these  tests  is  to  mimic  Scheffe'a 
motivation  (flj  of  Tukey's  singls  degree  of  freedom  for  non-additivity  19J. 

If  the  model  under  consideration  is  -  u  ♦  '>{  +  vj  *  *ij  +  rij  *nd 


l 


one  is  concerned  that  the  coefficient  \p  might  be  changing  from  block  to 

block,  the  procedure  is  to  insert  a  non-linear  r^rm  of  the  sort  <j>v.x.  .  into 

3  30 

the  model.  Then  this  expanded  model  is  approximated  by  the  linear  model 

A  A 

=  y  +  0^  +  Vj  +  ij/x^  +  4') jXi j  +  e„  where  is  the  standard  least 
squares  estimate  of  under  the  original  model.  It  is  then  the  sum  of 
squares  for  the  dummy  coefficient  <J>  adjusted  for  y,  9,  v,  and  ip  which  is 
used  to  test  the  adequacy  of  the  original  model.  The  fact  that  the  test 
is  neither  exact  nor  powerful  does  not  keep  it  from  being  very  useful  in 
the  field  of  model  building. 

Granting  that  one  has  adopted  a  model  with  individual  regression  co¬ 
efficients  for  the  blocks  and  fixed  effects  for  the  treatments,  one  then 
finds  the  literature  preoccupied  with  eliminating  the  effects  of  blocks 
and  covariates  in  order  to  focus  attention  on  the  treatments.  Much  of  this 
work  has  been  done  by  C.  p.  Cox  (1J.  Noteworthy  also  is  Zelen's  adaptation 
of  this  problem  to  incomplete  block  designs  [10]. 

Traditionally  a  covariate  has  been  considered  to  be  a  nuisance  factor 
to  be  eliminated.  Perhaps  the  covariate  has  been  the  victim  of  an  unfriendly 
pres3.  If  the  covariate  takes  on  a  limited  number  of  discrete  values  as  in 
Cox's  first,  second,  and  third  units  of  tifce  [1]  the  covariate  model  is 
actually  a  shortcut  to  a  factorial  analysis  in  which  all  effects  of  the 
covariate  factor  other  than  linear  effects  are  confounded.  These  effects 
may  be  of  great  interest.  Even  when  the  covariate  is  uncontrollable,  it  is 
quite  possible  that  it  is  known.  Of  course  if  all  experimental  units  show 
the  same  response  to  the  covariate,  that  is  if  y^  ■  y  +  6^^  + 
then  if  high  yields  can  be  considered  to  be  good,  it  takes  no  great  mathemat¬ 
ical  calculations  to  say  "the  higher  the  X  the  better"  or  "the  lower  the  X 
the  better,"  and  likewise,  "the  higher  (or  lower)  the  the  better."  But 
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when  the  regression  coefficient  depends  on  the  block,  the  value  of  the 
covariate  can  become  quite  important. 

Consider  three  blocks  with  fixed  effects  (taken  to  sum  to  0)  of  -1, 

0,  and  +1.  If  their  corresponding  block  regression  effects  (also  taken  to 

sum  to  0)  are  +1,  0,  and  -1  (that  is,  b^  =  -1  +  X,  b^  =  0  +  OX  =  0, 

b3  =  1  -  X) ,  then  the  question  "Which  block  gives  the  highest  yield?"  can 

be  answered  only  if  we  know  the  value  of  X.  Clearly,  if  the  covariate  is 

less  than  1,  the  third  block  will  be  preferred  and  if  the  covariate  is  more 

than  1,  the  first  block  will  be  preferable.  Ey  taking  the  block  regression 

effects  to  sum  to  0  the  possibility  of  a  large  general  regression  effect 

with  the  covariate  has  not  been  eliminated.  That  is,  in  the  model 

y.  .  *  u  +  6.  +  v.+  ifrx.  .  +  i.x.  .  +  e.  .  the  value  of  1J1  may  be  very  much 
13  i  3  13  3  13  13 

larger  than  the  values  of  the  Then  we  would  say  "the  higher  the  X 

the  better — but  if  X  happens  to  be  less  than  1,  try  to  use  block  three  and 
if  X  happens  to  be  greater  than  1,  try  to  use  block  one."  It  is  not  diffi¬ 
cult  to  see  the  userulness  of  this  sort  of  i  formation  which  is  based  here 
on  some  insight  into  the  "true"  parameters  of  the  model.  In  our  real  world, 
of  course,  we  will  have  to  estimate  these  parameters  and  we  shall  l>e  interested 
not  only  in  which  estimated  block  effect  is  larger  at  any  X,  but  in  whether 
these  block  effects  are  significantly  different  at  any  particular  value  of 
X  in  light  of  the  normality  assumption  made  for  the  e^'s. 

Our  interest  in  a  significance  level  arises  from  a.i  assumption  of  a 
lest  function  associated  with  the  choice  of  block.  If  blocks  represent  the 
type  of  material  it  is  logical  to  assume  that  some  types  are  more  expensive 
than  others.  Even  if  they  are  all  of  the  same  price,  keeping  the  materials 
separated  might  be  expensive  and  should  not  be  undertaken  unless  we  are 
reasonably  sure  that  the  blocks  are  really  different  at  some,  if  not  all. 
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values  of  the  covariate  X  (perhaps  the  temperature  or  humidity) .  This 
thesis  does  not  direct  itself  to  any  specific  loss  function;  hence ,  it  is 
left  to  the  practitioner  to  bridge  the  gap  between  the  loss  function  in¬ 
herent  in  the  application  and  the  significance  level  of  the  t'.*st. 

Returning  to  the  previous  example,  consider  the  block  and  regression 
effects  given  above  to  be  estimates  rather  than  known  parameters.  That  is, 

*  A  A 

let  b1  =  -1  +  X,  b2  =  0,  and  b3  *  1  -  X  . 


Now  the  question  to  be  answered  is  "for  what  values  of  x  are  blocks 
significantly  different?"  Surely  the  point  X  =  +1  will  not  be  one  cf  these 
points,  for  at  that  point  the  block  estimates  are  exactly  equal.  We  shall 
expect,  intuitively,  that  the  answer  to  the  above  question  will  be  the  zeal 
line  with  an  interval  about  the  point  X  *  1  deleted  And  if  that  interval 
included  the  whole  real  line  the  answer  would  be  that  the  blocks  are  nowhere 
significantly  different.  Thu  higher  the  significance  level,  implying  that 
there  is  a  greater  penalty  for  wrongly  reporting  block  differences,  the 
larger  we  shall  expect  that  interval  to  be. 

The  fact  tli at  the  most  conventionally  constructed  test  for  block  differ¬ 
ences  is  very  capable  of  answering  the  above  question,  not  with  one  interval, 
but  with  two  dlajoint  intervals  (by  saying  that  blocks  are  different  every¬ 
where  along  the  real  line  except  within  those  two  disjoint  intervals)  is  per* 
haps  the  most  interesting  aspect  of  this  thesis. 
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Knowing  the  values  for  X  at  which  blocks  are  significantly  different 
enables  one  to  know  when  it  is  worthwhile  to  be  particular  about  blocks. 

The  estimates  of  the  block  effects  tell  which  block  to  favor  at  any  given 
value  of  X.  No  consideration  is  given  in  t  is  work  to  the  problem  of  which 
blocks  cause  the  difference  when  difference  results. 

A  test  is  sought,  thon,  for  a  null  hypothesis  of  "no  difference 
between  blocks  when  X  <  xQ"  under  the  assumption  of  the  model 


y .  .  «  y  +  G.  +  ibx,  .  +  <p.x..  +  e.. 
ril  i  ij  J  ij  ij 


i  =  1,  2,  •••  ,  t;  j  =  1,  2,  •••  ,  r ;  e.  .  'V  NID(0,  a  ) 

J*  J 


The  ii,  9^,  v.i  i>,  and  are  unknown  constants,  y  represents  a  fixed  mean 
effect,  0^  represents  a  fixed  treatment  effect,  a  fixed  block  effect, 
iji  a  mean  regression  coefficient,  and  a  block  regression  coefficient. 

In  the  manner  of  other  two-way  models  these  effects  are  not  completely 
estimable  and  we  will  need  to  place  three  conditions  on  thorn.  The  usual 

condition  £  9^  =  C  is  made  to  make  the  estimates  of  the  0^  unique.  But 

i 

two  more  conditions  must  be  taken  to  discern  the  effects  from  the 

fixed  mean  effect  y  and  the  regression  mean  effect  4»X  As  usual  we  have 
a  great  deal  of  choice  regarding  these  conditions.  The  criterion  is  generally 
mathematical  neatness ,  but  in  this  problem  the  conditions  which  result  ih 
mathematical  neatness  conplicate  the  parametric  statement  of  the  tuU  hypo¬ 
thesis.  The  conditions  we  shall  use  are  j[  (x^  -  *  0  and 

l  (v ^  '  ,xt  .  4-  ^  .)  »  0  where  x^  -  £  J  .  Note  that  when  x^  * 

for  all  j  and  j’  the  first  of  these  conditions  simplifies  to  \  -  0  and 

j  3 

consequently  the  second  simplifies  to  J  (v  »  vx#  .)  *  0  .  By  defining 

j  3  3 

<c.  -  v.  +  Cx  .  the  second  condition  further  simplifies*  to  £  * ,  •  0  . 

j  j  ‘3  j  5 


Vli is  soecia.1  case,  x  .  ~  x  for  all  j  and  j’,  shall  be  b resented 

•  *  “1  '  ”  1 

fu.Uv f  not.  with  the  thought  that  this  case  will  occur  frequently  in  practice , 

but  rather  to  illuminate  various  theoretical  aspects  of  the  problem.  After 

full  consideration  of  this  special  case  it  will  be  shown  that  the  general 

case  fol  lows  by  appropriate  introduction  of.  matrix  multipliers. 

It  should  not  cause  concern  that  in  the  general  case  the  conditions 

used  to  solve  the  normal  ecr cations  depend  on  ..._  .  The  null  hypothesis  to 

0 

bo  tested  also  depends  on  ,  j  ,  Vrt  .  +  x.« ,  ®  0  "'or  all  •;  vc  -bus 

0  -  a  0 '  i 

H-.:  v  +  xn<p .  0  . 

a  j  oi 

.  When  a  statistic  is  . found  to  test  the  hypothesis  a*-  X  -  x  that  test 
will  bo  inverted  to  yield  the  set  of  X's  for  which  the  test  would  be  rejected 
This  test  inversion  will  not  yield  a  -onfldence  set  !  “cause  the  inversion 
does  not  extend  to  all  alternate  hypotheses.  For  example,  tf  finds  prob¬ 
able  values  of  X  if  v,  +  Xf} .  =0  but  not  if  v.  t  \C .  -  I  .  A  confidence 

3  1  3  1 

set  is  of  value  when  a  parameter  is  unknown,  lr.  this  problem  the  covariate, 

X,  is  krewn  and  wo  are  interested  instead  in  which  X’s  will  cause  differences 


in  ch«  r- locks.  Hence,  if  x  is  to  the  set,  then  we  have  reason  to  believe 

P 

that  assignment  of  block  will  affect,  yield  wnen  X  =  x 

? 


CHAPTER  I 


Th'E  SPECIAL  CASE  OP  EQUAL 
COVARIATE  BLOCK  MEANS 

The  least  squares  estimates  of  the  block  parameters  are  sought  as 
a  basis  for  a  test  of  their  difference.  The  experimental  model  under  con-* 
sideration  can  be  written 


yij 


V  + 


VV  V/ij 


+  *rjaij +  eu 


i  -  1,  2,  •••  ,  t;  j  -  1,  2,  •••  ,  ri  l  -  0!  I  et  -  0j  l  Sj  -  0 


where  x„  is  the  value  of  the  covariate  X  in  its  original  units  and  assuming, 
for  this  chapter,  the  condition 


x  .  *  x  for  all  j,  j 


In  referring  to  chapter  I, 

* 

of  this  reparameterisation  is  .0  make 

of  factoring  (x.  ,  -  x  .)  into  r,  and  a. . 

ij  1  3  ‘3 

of  variation  after  a  test  is  derived. 


+  and  Sj  *  •  The  purpose 

and  6^  independent.  The  purpose 
is  only  to  aid  in  analysing  sources 
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8 


Let 


A  ’  ‘‘ij’txr  '  *-l*  V  -3'  '  ?r>  • 


That  is,  the  vector  a^  has  elements  ai;.  .  Then  let 


-2 


a 

-r 


rtxr 


Let 


rxr 


diag  (r.)  Brxl 


01 

■  - 

K1 

\ 

*2 

• 

*2 

e 

en 

• 

e 

if  « 

e 

-txl  “ 

e 

• 

-rxl 

e 

e 

8 

K 

0 

_  r 

_  r 

.  * 

H  m  rt]i  +  H  e  . 
ij  3  ij  ] 


l  Vij  -  ry  +  r9i  +  I  +  *  I  +  I 


I  y 


il 


ty  +  t <j  +  I 


rJ  [ 


rj  \  Vi  *  rj6j  *  rJ*  *  rJ  1  Vu 

Note  that  the  conditions  ^8^0  and  l  m  0  were  used  in  forming  the 
above  relationships,  but  the  third  condition  £  6^  ■  0  has  not  yet  been  used. 
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Let  6r  0,  <,  ip,  i.Ad  y  be  the  unbiased  estimates  of  the  parameters  of 
the  model  retaining  the  conditions  restricting  the  parameters.  That  is 

AAA 

1*6  =  0,  1*0  =  0,  IV  =  0.  Then  the  normal  equations  become 

£  ■  wi 

~dt  8  r )%  =  yl  +  0  +  ■g-  ARg  +  ■—  ARl 

^  A  A 

“(1^  0  Ir)%  -  pi  +  < 

A  A  A  A  A  A  A  A  A 

(1)  RA^jr  =  RA'6  +  R  §  +  R‘ >1  =  RA'6  +  R  [§  +  i|il] 

Solvings 

(2)  RA 1  [-(I  0  1')-  ~  J]V  -  RA'0  +  -  RA'AR[0  +  ij/l]  . 

r  t  -r  rt  ~  r  ~  - 

Subtracting  (2)  from  (1) 

RA^y  -  RA'  [i(l^.  »Ir)*^  Jty  -  [R2  -  ~  RA' AR]  (6  *  ipi)  . 

2  1 

Assuming  the  matrix  [R  -  —  RA'AR]  to  be  nonsingular* 

§  +  il  -  [ R~ 1  ( I  -  j  A'AJ'V1]  (RA^  -  RA’i(It  0  r)Y^ 

and 

(I  -  i  J)  [0  +  ill  «  §  •  (I  -  i  J)[R’1(I  -  i  A'A)“V1](RA^[Irt  -  ^(It  0  Jr)])  . 

This  is  the  first  time  the  condition  j>  6^  ■  0  or  £  0^  *  0  has  been  utilized* 

and  it  is  used  here  to  eliminate 


i  '  l^i't  8  V-  -k  W* 


•See  Appendix. 
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Note  that  RA*  [t  .  -  -(I.  0  J  )]A.R  *»  R[I  -  -  A'AJR  .  We  assume 
a  rt  r  t  r  a  r 

o 

e..  'v  NID(0,  a”).  Hence, 


g  ^  N[$,  a2 (I  -  7  J) R_1(I  -  -  A’A)"1r"1(I  -  i  J)]  . 


Likewise , 


K  *  N [< ,  c2  ~(i  -  ^  j)] 


We  shall  be  interested  in  block  effects  at  x^,  so  we  seek  the  distri¬ 
bution  of  k.  +  B .  (xrt  -  y.  .5,  and  since  x  ,  =  x  ..  ,  for  all  j  and  j',  the 
j  j  0  * j  *j*j 

vector  of  block  effects  at  is  k  +  (xQ  -  x, ^)B  .  Letting  z^  **  x^  "  x.  j» 
we  seek  the  distribution  of  k  +  z^S  • 

A  A 

If  the  independence  of  k  and  6  is  not  apparent  from  the  manner  in 
which  they  have  been  constructed,  it  can  be  easily  verified  by  matrix 
multiplication. 


*d!irt  -  X  •  “V 1  X 8  y-  £  w  -  ° 


Hence, 


k  +  zQe  a.  wj<  +  z06,  o2(I  +  Z0R_1(I  "  r  a,a>”1r_1J(i  “  ~  J>) 

The  matrix  (I  -  —  J)R  *(I  -  —  A'A)  *R  ^(1  -  —  J;  will,  of  course, 
r  r  r 

figure  heavily  in  the  solution  of  any  problem  involving  blubk  differences. 
Some  properties  of  this  matrix  are  explored  in  the  appendix. 


CHAPTER  III 


AN  EXACT  TEST  FOR  THE  HYPOTHESIS 
IN  THE  SPECIAL  CASE 

In  terms  of  the  parameters  of  Chapter  I  where  »  \i  +  0^^  r  + 
i|>x^  +  $jx£j  +  e£j'  a  statistic  is  desired  to  test  the  hypothesis  that 
Vj  +  «  0  for  all  j  .  In  terms  of  the  parameters  of  Chapter  II#  this 

hypothesis  becomes  +  (xQ  -  xt  j)(3j  =  0. 

In  the  special  case  of  equal  covariate  block  means  where  x^  -  x^  ** 
xQ  -  x#j,  *  zQ  #  the  null  hypothesis  of  no  difference  between  blocks  can 
be  stressed  parametrically  as 

H0:  k  +  zQ§  =  0 
vs.  Ha!  K  +  ZQS  ¥  0 

From  the  work  of  Chapter  II  we  see  that 

<  +  zQl  *  N[0,  o2(I  -  ^  J)(£  I  +  ZqM)  (I  "  ~  J)  1 
under  under  HQ  where  M  -  r"’1(I  -  ~  A'A)"1R~1  .  Thus  under  the  null  hypothesis 
K  +  ZQg  -V.  N  [0,  02(I  ”  J)  ||  I  +  *Jm)  (I  “  “  j)  ]  . 

To  test  the  null  hypothesis,  then,  we  seek  a  statistic  with  a  well 
known  (i.e.,  tabled)  distribution  under  the  null  hypothesis  and  with 
expectation  under  the  alternate  hypothesis  dependent  on  (tc  +  zQ§) '  (k  +  zQ§) 
or  on  (<  ♦  +  rQ§)  where  A  is  a  positive  definite  matrix. 
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~  w*'v'V  « w'(1  "  7  '  7  J)w  ^  X2  i  -I 

a  a  '  r-l,”(y '  V~y) 

o 

This  statistic  is  independent  cf  the  mean  square  error  so  a  true 

F-statistic  can  be  formed  which  will  eliminate  the  necessity  to  know  the 
2 

error  variance  o  or  to  estimate  it  from  other  information.  The  independence 
follows  from  the  fact  that  the  sum  of  squares  for  error  is  constructed 
orthogonal  to  those  for  <  and  those  for  §  .  Hence,  MSE  is  independent  of 

A  % 

an.  function  of  the  £  and  g  vectors. 

Having  produced  a  statistic  which  will  test  the  hypothesis,  we  would 
be  satisfied  if  we  were  interested  in  only  one  value  of  X(or  Z) .  But, 
of  course,  we  wish  to  produce  a  set  of  x's  for  whi  h  the  null  hypothesis 
can  be  rejected.  Zt  is  in  this  regard  that  the  statistic  T(zQ)  becomes 
unwieldy. 


«H|  H 
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2 

Note  that  if  C  is  a  critical  point  for  x  statistic  with  r-1  degrees 

2 

of  freedom,  then  we  would  be  interested  in  solving  the  equation  T(z)  ■  o  C  ^ 
for  the  z  points  at  which  T(z)  equals  the  critical  point.  With  those  values 
of  Z  we  could  deduce  the  set  of  points  for  which  T(z)  >  u  j  from  the 
continuity  of  T(z) . 

Decompose  (I  -  —  J)M(I  -  ~  J)  into  eigenroots  and  vectors  by  letting 


r-1 


u!l  =  0 

-i- 


(1  -  i  J)M(I  -  i  J)  -  l  HiV  -  1  i  -  i' 

i=l 

HjHi.  -  o  i  +  i' 


Note  that  there  is  one  zero  eigenroot  which  corresponds  to  the  vector 
-±-1  . 

Then  T (z)  can  be  expanded  in  terms  of  these  came  eigenvectors. 


T(z)  -  (k 


-  +  z&) '|t  1  +  *  I  ^i-i-i)  +  z-) 

.  /r-1  ,  ,  ,  ,  \-l  - 

§) I  <±  +  z"Yi)Hiy'  +  ±  ”  11*1  <k  +  zB) 


(k  +  zB) 


“  (k  +  zB)  '(  \ 


[T-r^U 

\i-l  *  Y.  +±l 


(k  +  zB)  since  (x  +  zB)  1  (11')  ('■:  +  zB)  *  0 


rr1  +  +  as2§,yiki§ 

"  l  2~  1 

i-i 


Each  of  the  r-1  terms  is  a  ratio  of  two  quadratic  functions  in  t. 

Hence,  if  r»2  there  would  be  only  one  term  and  we  could  solve  the  equation 
2 

T(z)  «o  getting  at  most  two  solutions  zcl  and  r.^,  zcl  <  z^  .  He 

would  need  to  check  the  magnitude  of  T(z)  and  any  one  point;  probably  T(0), 
Without  loss  of  generality  assume  that  T(0)  <  .  Then  if  0  e  (*c^»  *c2) 
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:-jp  would  I:';';..  V  b.'.OCX.!S  4 


'■?  s L gr, i f i ears i.:  t •■■■  d.irterenr  er'iy  ours  Ida  rk -j 

i •,  •-  }  ocVn  are 


■  (:t  ,  )  .  Or.  if  0  /  i?  , ,  a  _)  w<?  won  1.4  state 

cl  c<!  cl  c2 

significantly -different  only  inside  the  interval  (t,  , 

This  cnee  pas-/  solution  is  avai  l able  in  the  event  that  ail  the 
eiqenroctn  arc  identical,  that  i s ,  v ,  *  y  ~  >  for  all  i  and  i 1  .  Than 

2.  j 


k  *  ic  +  2  zk  '  +  z  B  '  i; 


')  - 

"ft  1 , 


2 

2  Y 


L 

t 


And  again  equation  T { r.)  to  a  critical  va.l-.io,  . 
moot  two  values  fot  Again  by  continuity  we  know 
significantly  different  only  when  z  is  between  the-.-. 

that  interval. 


"C  , ,  will  yield  at 
that  block::,  will  be 
■  two  value-?  or  outside 


These  special  cases, 


,  for  all  i  and  i‘,  can  bo  considered 


geometrical lv  bv  examining  the  en nation 


1 1  •  +  2  •  3 


2  j. 

•  - 


r''i?.nt  notice 

T  (,0‘:  -  tK ’c  T  o  for  a  L  t  :-. 

T(«)  **  B’r 

The  extreme  points  of  the  function  T(r.)  are  obtained  bv 


I:--.",  4-  •*  Irtish  4  2"  ’ 


■  -  \ 

t-  - -  '’y. 

..  .  \  »  * 


c,:b.  4  Jr 


'  *  I  -  -OV  1 
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Then 

A  A  A  A 

2*  A  §'§  E'S 

0  »  z  <  '  |y  -  z  — — -  -  2y<  '<  -  — — 

where  z  is  a  maximum  or  minimun  of  the  function  T(z) . 


and 

z  »=0  if  k' 3  =  0  . 

C  *“  *“ 

Hence,  if  ic'3  ¥  0,  T(z)  has  two  extreme  points  z  ,  <  0  <  z  _  and  the 

-  —  Cl  C  4. 

A  A 

single  extreme  point  0  if  <'3  ■  0  . 

If  r=2,  one  o£  the  extreme  points  yield  a  minimum  of  T(z)  =  0  .  If 
r  >  2  it  is  doubtful  that  T(z)  will  actually  attain  0  . 

A  A 

Then,  if  <'3  >  0,  T(z)  has  two  real  extreme  points  and  we  can  sketch 
the  illustrative  curves  of  Figure  a. 


s 


Figure  a 
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In  all  of  these  figures  a  critical  point  for  the  statistic  at  any 
2 

zQ  is  the  point  (z^,  a  .  Thus  a  critical  curve  for  the  function  T(z) 

2 

would  be  a  straight  line  o  units  above  the  origin.  Any  horizontal 
line  which  intersects  any  of  the  curves  intersects  it  twice  giving  rise  to 
end  points  of  a  critical  interval  (z  ,  z  ) .  [The  only  exception  to  this 

i’i 

generality  would  be  a  line  units  above  the  origin  in  Figures  a.  and  b. 

which  would  produce  a  critical  interval  of  the  form  (-«,  z  )  or  (s  ,+*»).] 

c  c 

2 

Let  be  the  set  of  z's  such  that  T(z)  >  o  C  ^  .  That  is,  if 

Z0  e  '  block  effects  are  significantly  different  when  Z  =»  zQ  .  Let  U 

be  the  totality  of  points  on  the  real  line. 

Then  when  y^  =  y^,  for  all  i,  i',  the  set  ftT  =  (z^,  Zc2)  or 

flj,  *  TT  -  (acl,  z^) .  The  syittoolism  U  -  (z^,  z^)  is  used  as  shorthand 

notation  for  (-°°,  z^)  U  (z^,  +00)  /  the  union  of  two  open-ended  intervals. 

No  distinction  will  be  made  here  between  open  and  closed  intervals  which 

2 

shall  be  justified  only  by  saying  that  T(z)  »  o  cr-1  only  on  a  set  of  z's 
of  measure  zero. 

The  above  reference  to  infinite  values  of  the  covariate  occur  only 
for  completeness  in  consideration  of  the  function  T(z).  As  in  any  regression 
type  problem,  we  are  not  justified  in  extrapolating  our  results  beyond  the 
range  of  the  covariate  actually  covered  in  the  experiment:.  We  are  even  more 
restricted  in  this  particular  model,  and  should  not  place  much  weight  on 
covariate  values  beyond  the  range  covered  in  each  and  every  block.  Hence, 
our  interest  in  T(«)  is  as  an  unattainable  limit  point  rather  than  an 
asymptote . 

Thus  far  we  have  considered  only  the  situation  when  all  the  eigenroots 
are  identical.  When  the  roots  are  different  the  situation  is  much  more 
cooplex.  Let  r-3  so  that  (I  -  ~  J)M(I  -  —  J) ,  the  covariance  matrix  of  6, 


18 


h a:t  two  roots  Y^  and  y2  such  that  ?  Y2  •  Then 


T(z) 


(k'Pj)2  +  2z(K,Hl)(g,1i1)  +  z2fg*Ml)2 

__ 

V  +t 

(«'M2)2  +  2z(k ’jj^)  <3  * ^2 ^  +  z2  2 

+  2  1 

V  +t 


If  T (2)  is  set  equal  to  a  critical  value  the  resulting  equation  is 
of  4**1  degree  in  z.  That  is,  we  could  have  four  values  Zy  z^,  Zy  and  2^ 
such  that  "  (z.,,  z2)  U  (Zy  z^)  ox  TT  -  [(z^  z2)  U  (Zy  z^)  ] . 

When  there  are  three  different  eigenroots  equating  T(z)  co  a  critical 
value  produces  as  many  as  6  critical  z's.  Then  the  set  could  have  the 
form  (z^,  z2)  U  (z3»  zj  U  (z5»  zfi)  or  T T-  {{Zy  z2)  U  (Zy  zj  U  (zg,  zfe)  ] . 
Likewise,  when  ihere  are  r-1  different  eigenroots,  there  can  be  as  many  as 
2(r-l)  different  critical  z's. 

Consider  the  following  example. 

Example  1.  Let 

2  1 
0  ■lir«3|t«5iK1*5  jy2*1; 

Ml  •  -i-<l,  -1,  0}  j  Hi  •  — <1,  1,  >2)  s 

_1  /2  ’2 

x,j  »  x,j,  ,  for  all  j,  j’ 

That  is,  this  is  an  example  of  the  special  case  of  equal  covariate  block 

means. 

2&  -  3/r  +  3/2  Jl  \ 

12  '12  '  ‘  3  / 

i.  ♦  2/2  V?  -  3/f  1 

s  •  \  l  *  6  •  -  3  / 
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Let 


Tx(z) 


ec'^)2  +  2z(j?'Ul)  (|'Ul)  + 
.2z2  +  .2 


2 

z  -  z  +  .25 
2 

. 2z  +  .  2 


and  let 


V») 


+  +  <SV  z2  +  2Z  +  1 


z2  +  .2 


z  +  .  2 


Then 

T(z)  =  T1(z)  +  T2(z)  .(x2  aJo2 

where  X  »  0  when  ic  +  z§  »  0  since  a  -  1  . 

The  functions  T^  and  each  have  the  shape  of  a  T(z)  function  with 
identical  roots  as  previously  discussed;  but  their  sum  does  not  (see 
Figure  I).  (The  illustrations  with  Roman  numerals  appear  in  Appendix  II.) 

With  a  significance  level  of  .10,  i.e.,  a  =  .10;  Qrj,  =  U  -  (-.53,  -.31) 
-  (.61,  2.4).  The  ot)  =  .10  critical  line  for  is  shown  on  Figure  I  also. 
The  Tj^(z)  fcnd  T2(z)  cruves  can  be  individually  compared  with  that  line. 


CHAPTER  IV 


INTERPRETATION  OF  EXACT  TEST  RESULTS 

Not  only  is  it  difficult  to  solve  a  problem  for  in  this  special 
case  of  equal  covariate  block  means  when  the  eigenroots  of  the  covariance 
matrix  are  different,  it  is  also  difficult  to  interpret  the  solution. 

(Such  a  solution  is  even  more  difficult  to  find  in  the  general  case,  as 
will  be  seen  later.) 

In  an  attempt  to  interpret  fiT  ,  visualize  r  lines  with  intercepts 
and’ slopes  3^  -  Unless  a  set  of  r  straight  lines  have  exactly  the  same 
slope  they  will  become  "infinitely"  far  apart  as  |z|  approaches  «.  So  in 
deciding  hew  to  answer  the  question  "Are  block  effects  different  when 
2  ■  zQ?"  or  "Are  these  straight  lines  different  at  Z  *  z0?"'  0116  must  use 
the  magnitude  of  zQ  as  part  of  the  judgment  criteria.  The  larger  |z|  becomes, 
the  more  difference  between  the  lines  it  takes  to  be  "surprising." 

A  A 

T (0)  ■  tK V  will  be  recognized  as  a  test  statistic  for  "block  mean 
effects."  Likewise  T(«)  »  6'[R(I  -  ^  A'A)R]§  is  the  usual  test  for  block 
regression  effects.  It  is  essentially  this  test  to  \rfiich  Cox  [1]  refers. 

In  this  problem  T(»)  is  the  value  of  the  test  statistic  when  |z|  is  so 
large  that  the  fixed  effects,  or  intercept  effects,  are  completely 
"washed  out."  Of  course,  in  practice,  this  point  (Z  -  ®)  never  occurs 
because  the  upper  and  lewer  values  of  z  at  which  t(z)  is  meaningful  is 
limited  by  the  range  of  the  covariate  within  each  block  in  the  axperiment. 

If  T(»)  >  o  i.e.,  *  c  fly,  then  the  r  lines  in  question  have 

"different"  or  "significantly  different"  slopes.  7*he  r  lines  are 
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restricted  in  the  sense  that  the  sum  of  their  slopes  is  0  and  the  sum  of 
their  intercepts  is  0.  For  examples  with  4  lines,  see  Figures  d.  and  e. 


Figure  d.  Figure  e. 

In  Figure  d,  although  the  lines  do  not  meet  at  a  single  point  it 

would  not  be  surprising  to  find  an  interval  about  z^  in  which  block 

differences  cure  uncertain.  Whether  or  not  that  interval  would  include  the 

2 

point  z  =  0  would  depend  on  whether  or  not  T(0)  <  0  cr  ^  *  T(0)  will  be 

recognized  as  the  usual  test  for  the  k  effects.  Figures  d  and  e  have  been 

drawn  with  the  same  < ^  values  so  that  if  T(0)  <  c<’Cr_1  in  Figure  d,  T(0) 

2 

is  also  less  than  a  C  ,  in  Figure  e,  and  it  would  be  logical  to  assume  an 

r-i 

interval  about  0  in  which  the  lines  would  not  be  considered  significantly 
different. 

2 

Similarly  if  T(»)  <  a  Cr_^  then  the  r  lines  will  have  approximately 
the  same  slopes;  end  since  the  slopes  are  restricted  to  add  to  zero,  the 
lines  will  all  h*>ve  slopes  very  close  to  zero.  5ee  Figures  f  and  g  for 
examples  with  4  lines. 
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These  cases  are  more  difficult  to  evaluate  subjectively  because  the 

effect  of  Kj  “washes  out"  as  j  z j  increases. 

2 

If  T(0)  >c  Cr_1  (perhaps  Figure  f)  then  there  would  probably  be  an 

2  2 

interval  about  0  in  which  T(z)  >  a  C  ,  .  However  if  T(®)  <  a  C  ,  and 

r-i  r-i 

2 

T (0)  <  a  (Figure  g)  it  may  be  difficult  to  understand  that  there  could 

still  be  an  interval  (in  Figure  g  in  the  region  z  <  0)  in  which  the  lines 
could  be  considered  significantly  different.  If  this  is  so  it  is  because 
the  lines  spread  apart  faster  than  we  would  expect  with  regard  to  |z|  .  Note 
that  the  lines  are  farther  apart  at  -6  than  at  +6  in  Figure  g. 

Scale  has  intentionally  been  omitted  from  these  figures  to  force 
subjective  thinking  rather  than  mathematical  manipulation.  Certainly  any 
decisions  must  take  scale  into  account.  The  purpose  here  is  to  suggest 
that  the  logical  systems  for  evaluating  whether  a  set  of  lines  are  appreciably 
different  at  Z  =  z^  relative  to  [zg|  give  rise  to  single  intervals  of 
difference  (as  in  Figures  f  and  g)  and  to  complements  of  single  intervals 
of  difference  (as  in  Figures  d  and  e) . 


Stated  differently,  let  be  the  set  of  z's  for  which  a  set  of  lines 

is  considered  to  be  different  by  some  criterion  k.  If  ®  e  z'  1 
and  z”  i  then  we  would  want  z  ^  if  z'  <  z  <  z"  .  (Figures  d  and  e.) 
Ar*d  if  «  i  z'  g  and  z"  e  then  we  would  want  z  e  ^  if 

*'  <  z  <  z"  .  (Figures  f  and  g.) 


This  desire  for  8^  to  be  of  the  form  (z^,  z 2)  or  T T  -  (z^  z^)  arises 
from  placing  equal  weight  on  each  of  the  lines,  or  on  each  contrast  of 


the  lines  since  they  are  restricted  to  add  to  0.  And,  indeed,  is  of  this 
form  when  for  all  i  and  i'. 

However,  T(z)  does  rot  place  equal  weight  on  each  contrast  of  the 


Sj's  unless 


y . ,  for  all  i  and  i ' . 


Thus  nT  is  net  necessarily  of  the 


form  z2 )  or  U  -  ( z z 2) .  Through  the  matrix 

(I  -  —  J)R-1(I  -  —  A'A)  1R  1(I  -  —  J)  the  statistic  places  more  weight 
r  r  r 

on  the  contrasts  about  which  there  is  the  most  information.  For  clarifi¬ 


cation,  consider  again  Example  1  (page  18) . 

In  Figure  II  (Appendix  II)  ,  the  lines  v^  =  ;l 


v2  *  +  J^E — z,  and  v,  =  -  - - - —  z  are  plotted.  The 


2^6  -  3/2  .  Jl  +  3/2 

S  V1  “  12  6 
JS 

- - z  are  plotted.  The 


shaded  area  represents  T7  -  the  z  values  at  which  we  cannot  be  certain 

of  blodc  differences  testing  with  a  .10  significance  level.  Because  cf 

the  V  8.  =  0  and  S  < ,  =  0  restrictions,  the  3  lines  of  Figure  II  are  redun- 
L  3  L  J 


dant. 


A  A 

In  F-fgu-®  rzz,  the  two  unrestricted  lines  w^  *  k'v^  +  £ -  .5  1  s 
and  w„  =  k'v.  +  3'u«z  *  1  +  z  are  plotted.  The  region  of  U  -  is  again 

4  —  “2  —  “2  T 

shaded. 

In  Figure  III,  where  the  two  lines  are  perfectly  parallel,  the 
multiple  ft  - zone  is  particularly  enigmatic  as  an  answer  to  the  question 
"For  what  values  cf  z,  considering  | z|  ,  can  and  w2  be  considered 
estimates  of  zero?" 

It  is  understandable  that  when  0  is  in  J2T  points  on  either  side  of 
0  might  not  be,  due  to  the  "washing  out"  of  <  with  increasing  j z\  .  But 
that  even  larger  values  of  |z|  should  indicate  block  differences  is 
difficult  to  accept. 

The  result  is  due,  of  course,  to  the  difference  in  roots  causing 


more  enphasis  on  as  |z|  increases. 

The  unequal  weighting  reflects  the  fact  that  we  have  more  information 
about  x'Uj  and  B'u^  than  and  |'u2  .  Certainly  this  ability  to 

concentrate  on  the  qualities  about  which  we  have  the  most  information  can 
be  a  desirable  property  for  a  testing  statistic.  But  when  there  is  equal 
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interest  in  each  of  the  contrasts  betwee..  effects  this  property  may  not  be 
so  desirable. 

In  terms  of  power,  T(z)  would  have  higher  power  against  alternatives 
2 

which  make  (< * p ^  large  than  against  alternatives  which  make 

2 

(<'^2  +  1 P 2 ^  lar9e'  whereas  equal  power  against  both  of  these  types  of 

alternative  hypotheses  would  be  desirable. 

If  we  could  control  the  covariate  Z  we  would  have  equal  information 
about  each  of  the  contrajts.  When  we  don't  have  equal  information,  i.e., 
equal  roots,  we  must  consider  whether  we  prefer  a  more  exact  answer  to  a 
question  we  are  only  approximately  asking  or  a  more  approximate  answer  to 
the  exact  question  we  are  asking. 

Recall  that  when  we  first  3 ought  a  statistic  to  evaluate  block  diff¬ 
erences  (page  11)  the  first  choice  was  oae  with  ejqsectation  dependent  on 
(k  +  z^g) ' (k  +  zQS) ,  and  T(z)  with  expectation  dependent  on 
(<  +  zg) ' [£  I  +  z2(I  -  —  J) M(I  -  —  J) ]  1(k  +  zg)  was  selected  only  because 
the  exact  distribution  of  T(z)  is  tabled. 

Let  us  look  at  a  statistic  with  expectation  dependent  on 


(k  +  zg)  '  (k  +  zg) . 
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Although  S*(Zq)  is  not  recognizable  as  any  single  distribution*  it 

o 

can  be  decomposed  as  a  sum  of  independent  x  statistics.  Let 


[H  +  *0L1'[H  +  zql]  -  l  5^^' 


where 


1*1  ■  1 
-i-i  A 


iiii."  0  i  ¥  i’  , 

being  a  vector  with  rt  elements.  Note  that  the  rank  of  H  +  zQL  is  r-i 
so  only  r-1  of  the  rt  eigenroots  are  nan -zero. 

Me  know  that  the  non-sero  roots  of  (H  +  zql) 1 (H  +  zql)  are  identical 
to  the  non-zero  roots  of  (H  +  xQL) (h  +  zQL) 1  .  And  (H  +  zQL) (h  +  zQL) '  « 
(I  -  i  J)  1  +  z*(I  -  j  J)M(I  -  J  J)1  (1  -  *  J)  ,  which  we  recall  from 
T(Zq).  In  the  consideration  of  T(z^  we  let  (H  +  zql)  (h  +  zQL) '  ■ 

l  (£  *  V.^KiEi  *  H*nc*  ***•  *  *oYi*  '*  to  th* 

1  2 

set  of  ^'s  and  we  Shall  v,9  the  notation  —  +  zQyi  since  it  shows  the 
dependency  of  the  roots  m  zQ  . 

Therefore* 


8*<«o>  -  x'  I  t<£  ♦  Vi^i-l1*  "  l  (t  *  ■oYi>*'Wi* 


where 


■jn'tiii*  '  • 

C  l 

2 

That  is*  each  of  these  r-1  terms  is  proportional  to  a  x  statistic  with 
one  degree  of  freedom  and  all  are  independent.  That  ■  o  under  R0*  the 
hypothesis  that  k  ♦  s0g  ■  0*  follows  from  the  "oentrality"  of  8* (sQ) . 

That  is,  E’tJ  Mi-i1^  *  0  H’liiiE  •  0  • 
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Being  a  weighted  sum  of  independent  x  statistics  under  HQ,  the 

distribution  of  S*(z^),  under  HQ,  can  be  approximated  by  fitting  the  first 

2 

two  moments  to  a  multiple  of  a  x  statistic.  Since 


-  r-1  .  _ 

E  (S*  (zQ)  ]  —  cf  l 
i 


S*(z  )n(z  ) 
U  0 


J 


■  n(zo> 


where  n(zQ)  is  the  nuntoer  of  degrees  of  freedom  of  the  x  statistic  which 
is  being  approximated.  Then  we  wish 


or 


Var 


S*(z0)n(z0) 


r-1 


o2  l  <±  *  V0> 


*-  i=l 


=  2n(zQ)  , 


Now 


[n(z0)l 


TTT. - FP  Var  S*(Z0)  "  2n(z0}  * 


Var  S*(zQ)  *  l  Varty^Q  +  =  l  (-izo  +  t**2®4 

,  4  rrX  2,2  2  rrX  r-l"] 

“ 20  L7*o  \  Yi + 1  zo  \  \  +  -prj 

r;:1  2  12 

[  (Vo  +  ¥> 

*  fw  :  7T2 

l  (Yiz0  +  t* 

_  1  _J 


Hence 


where 


and 


Let 


and  since  there  are  r-1  all  greater  than  C 


0  <  6  <  (r-2)v2 
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Hence , 


n(zQ)  <  r-1 


n(z  )  >  r-1  ^ - ij 

Y  +  (r-2)y 


n(z0)  =  r-1  when  zQ  =  0  or  when  6=0.  6=0  corresponds  to  the  case  ot 

equal  eigenroots.  Let 


S*<20> 
S(V  -ir-T 
V  +  t 


1  .  n(V  •  2 

Xn(zn) 

a  0 


S(Z0)  =  T(z0) 


when  y  =  =  Y  for  all  i  and  i'  . 

When  the  roots  are  equal  both  S(zQ)  and  T(zQ)  give  the  same  result. 
When  the  roots  are  different,  T(zQ)  becomes  very  cumbersome  and  can  yield 
results  which  are  difficult  to  interpret.  When  the  roots  are  different 
S(zQ)  condensates  by  varying  the  critical  point,  the  amount  of  condensation 
dependent  on  £  Y  ^  and  }'  y?  which  are,  respectively  the  traces  of  the 
matrices  (I  -  ~  J) R  *(I  -  ~  A*A)  ^R  *(I  -  ~  J)  and 

[(I  -  -  JiR'^t  -  -  a,a)~1r"1(i  -  -  J)]2  . 

r  r  r 

In  order  to  discover  the  nature  and  extent  of  the  s{z)  statistic's 

compensation  for  different  roots,  it  will  be  necessary  to  examine  the 

r-1  v2 
n{zQ)  nU^) 


function 


CHAPTER  VI 


ANALYSIS  OF  APPROXIMATE  TEST 


The  function  S(z)  has  exactly  the  same  shape  characteristics  as  the 

function  T(z)  when  the  roots  are  identical.  That  function  is  discussed 

fully  in  Chapter  III.  However,  the  critical  function  for  T(z)  with 

2 

identical  roots  is  a  straight  line  with  ordir’te  a  Cr_^  a  where  a  gives 

the  significance  level  of  the  test,  whereas  the  critical  function  for  S(z) 
2  2  r-1 

is  the  function  a  K  .  .  =  a  ,  .  C  .  .  .To  analyze  this  critical 
n(z)  n(z)  n(z),a  1 

function  let  us  consider  the  variation  of  C  with  n  .  The  functional 

n,a 

notation  showing  that  n  is  dependent  on  z  will  not  be  used  here.  From 

the  previous  chapter  it  should  be  clear  that  n  is  dependent  not  only  on 

z  but  on  r,  t,  and  the  values  of  the  covariate  X  used  in  the  experiment. 

At  this  point  attention  should  be  focused  only  on  the  variation  of  C  „ 

n,a 

with  n  regardless  of  how  or  why  n  might  be  varying.  The  notation  "n(z)M 

will  be  resumed  later. 

C  is  defined  by 
n,ot 


J 


n,u 


,  2,  .  2 

g(xn)dx 


1-ct 


The  curves  —  C  versus  n  aru  plotted  in  Figure  IV  for  seven  different 
n  n 

significance  levels.  When  a  is  small  —  C  decreases  with  n  .  When  a  is 

n  n 

very  large  —  C  increases  with  n  .  When  a  -  .25  the  function  —  C  is  very 
n  n  n  ii 

nearly  a  straight  line.  (Actually  it  is  concuve  downward  with  its  maxima 


at  n  -  2  .)  Now  if  —  C  could  be  well  approximated  by  a  constant,  as  is 

n  n 
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true  for  a  -  .35  and  for  other  significance  levels  in  certain  ranges  of  n, 

finding  critical  points  for  S(z)  would  be  very  easy.  Note  that  the  critical 

2  2  r-1  2  1 

function  for  s(z)  becomes  a  K  -a  -  C  ,  =  a  C  ,  because  if  —  C  is 

n  n  n-1  r-1  n  n 

constant  for  all  n  we  may  use  n  =>  (r-1)  and  it  is  equal  1"  C  ,  . 

r~J  jl—x 

Hence  if  the  desired  significance  level  for  *  a  test  is  .25,  as  might 

be  the  case  if  the  more  serious  testing  error  is  to  fail  to  recognize  that 

the  blocks  are  different,  that  is,  Type  I  errors  are  tolerated  to  decrease 

the  probability  of  Type  II  errors,  then  S(z)  can  be  evaiua  ad  against  the 
2  2 

straight  line  a  C  ,  .  Where  S(z„)  >  a  C  .  ,  the  point  z„  car.  be  considered 
r-1  0  r-1  0 

2 

to  belong  to  the  set  ,  and  where  S(zQ)  <  a  ,  the  point  zQ  can  be 

considered  to  belong  to  the  complementary  set  U  -  ft  . 

Unfortunately  the  function  ^  is  not  so  well  approximated  by  a 
constant  for  small  values  of  a,  and  when  n  is  small  the  variation  can  be 

considerable.  To  evaluate  the  amount  of  error  introduced  bv  using  ~r  C  . 

r-1  r-1 

rather  than  the  correct  —  C  .  Note  that 

n  n 


(r-1)-  C  = 
n  n 


a2  (r-1) 


r-1 


Cr-1  + 


o2  (r-1) 


pc 

In  n 


1 

r-1 


For  small  values  of  a  the  quantity  in  brackets  is  always  positive 

since  n  -  r-1,  and  it  increases  as  n  decreases.  To  find  any  bound  on  this 

error  it  is  necet -ary  to  find  out  how  small  n  can  become. 

From  a  theoretical  standpoint  it  should  be  pointed  out  that  a  values 

of  .50  or  .'r'  will  make  the  quantity  in  brackets  negative,  but  the  curves 

are  -.til)  mono  ton  i  c  and  the  maximum  error  would  occur  when  n  is  at  its 

minimum  valu-.  From  a  practical  point  of  view,  it  is  difficult  to  imagine 

any  ».i ♦  -  tat i->n  with  «uch  a  large  :<  where  one  would  not  satisfied  to 
r-1 

aptaoxtmaf,.  -  ny  c  . 


32 


In  Chapter  V  it  was  shown  that  n(z)  is  bounded  below  by  1  .  This 
lower  bound  is  not  dependent  upon  the  r,  t,  and  x^  s  of  the  experiment. 
When  whose  factors  are  incorporated,  a  much  higher  lower-bound  for  .i(z) 
can  be  obtained.  Ignoring  the  practical  limitation  which  the  x_'s  place 
on  z,  a  lower  limit  for  n  can  be  achieved  by  letting  z  approach  infinity 
since  n(z)  is  a  monotonic  decreasing  function  of  |  zj  .  Note  that 


2-  12 

—  *  ■ .  T.  ..  (z  y  +  p)  -2 

Limit  ,  ,  Limit  ,  t  y 

I  n(z)  =  |  |  (r-1)  — - - ” - —  =  r-1  — • 1 - 

z  •>  «  z  -v  «  .2-  12  4  -2 

1  11  (z  y  +  — )  +  z  6  y  +  6 


-2 

( r- 1 ) ^ — 


That  is 


n  (“)  = 


where 


C“X 

V  y ■  -  tr ( I  -  —  J)R“1(I  -  -  A,A)_1R_1(I  -  - 
k  i  r  r  r 


l  =  tr [ (I  -  X  J ) R~ 1 ( I  -  X  a’A)"1r"1(I  -  X  j)  ] 2 


Hence, 


1  <  n(°=)  <  n(z  )  <  n  (z)  <  r-1  , 

P 


where  z^  is  a  practical  upper  bound  on  J z|  imposed  by  the  values  of  x„ 
used  in  the  experiment. 

Referring  again  to  Figure  IV,  with  small  a  the  function  ^  is  a 
monotonic  decreasing  function  of  n(z).  So  since  niz)  is  a  monotonic 
decreasing  function  of  |z|,  ';he  critical  function  for  S will  be  a 


monotonic  increasing  function  cf  z 


I  CD  > 
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The  S  (z)  curve  will  be  shaped  as  in  the  equal  eigenroot.  case  of  T(z), 

2 

however,  the  critical  curve  for  S(z)  which  shall  be  called  o  K  ,  .  is  not 

n  (z) 

2  2 

a  straight  line,  as  is  a  ^  .  The  curve  a  is  symmetric  and  when 

a  is  not  large,  it  is  concave  upward  being  asymptotic  to  the  horizontal 


line  represented  by  a  .  See  Figure  i. 


0 


2.  -> 

Fi gure  i . 


It  should  be  pointed  out  that  the  critical  curve  is  always  symmetric, 

bounded  and  asymptotic  to  the  line  corresponding  to  n(°°)  ,  but  its  shape 

need  not  be  that  of  Figure  i.  The  only  exception  to  its  being  concave 

upward  (small  a)  or  concave  downward  (very  large  a)  is  the  case  of  a  =  .25  . 

In  that  case,  the  curve  will  be  bounded  above  by  where  b  is  the  maximum 

of  2  and  n(“>).  In  other  words,  when  using  a  =  .25  if  n(°°)  <  2  find  an 

upper  bound  for  the  critical  function  with  2  rather  than  n(“). 

2 

Since  the  critical  cvrve  based  on  a  x  statistic  is  virtually  a 

2 

straight  line  when  «  =  .25,  the  point  may  seem  trivial.  However,  when  0 
is  not  known  and  use  is  ’nade  of  a  F-statistic,  the  shapes  show  a  little 
more  variation  as  the  degrees  of  freedom  for  error  change.  The  x"  statistic 
discussed  here  is  equivalent  to  the  F-statistic  situation  witli  infinite 
degrees  of  freedom  for  error.  Obviously,  unless  the  decrees  of  freedom 
for  error  art:  very  small,  t.ne  situation  will  be  similar  to  that  illustrated 


in  Figure  i .  '"he  situation  Is  discussed  in  Chapter  VI 1 1  and  is  only  brougni: 

up  at  this  tir»  to  warn  the  reader  that  the  critical  function  for  cr<  F  (?.) 

statistic,  though  bounded,  may  not  always  have  the  same  shape. 

Solving  for  the  points  and  r.,  in  Figure  i,  the  intersections  of 

a  b 

S  (z)  and  :s  K  .  ,  can  be  done  iterat  i  ve lv,  since  the  intersections  of  S  (z) 
n  (z) 

with  any  horizon!*  il  line  ire  found  easily.  That  there  might  be  more  than 


two  intersections 


cus  cussed  an  t.  ■  mpie  3  on  low . 


Consider 


example  orcblems  -Reconsider  Example  1,  previously 


discussed  in  Chapters  Tfl  and  IV. 


Example  1,  Figure  v. 


=  !  .  A.  -  .1" 


A'  -  *  —  C. 

a  ~  .if  L  .•  >  * 


t  Y  ^  ■“  -if 


v  2  26  -  13  13  4 


(<  +  *3)  '  (,c  +  u:'i  -  *  y1  -1-  ct'y,)"-  it-  *-  zf'v  )“  *  2z*  +  z  +  1.25 


V2>  = 


and.  in t 


(>-  "■  r-V'W\  ^  _  F  -  s  ,  .25 

2-1  ,  2  .  - 
::  V  »-  -  .6z  +  .2 


(c  +  z6;'v,v.;(k  +  z3)  2 

-  -2“3  -  ”  Z  +  <cZ  + 


2-  .  1 
Z  V  + 


z  +  2z  +  1 

•  6z^  +  .2 


JO  tl.al' 


Sui)  •*■ 


(«,.§)  Ms*  .8)  2b^  ...  1.25  . . 

- - - - - S  C)  ♦  s  C) 

z  y  +  7  .6z  +  .2 

W 


i  2-  s. 

(Z  Y  *  y) 


2  2 

(.6z  +  .2) 


^  (a?.;  •  A 


2  2  4 

l.ti!  +  .2)  +  .16* 
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a!2L.  ! 

r-l 


n(°°) 

r-l 


(.6T 


(.6)' 


.16 


.36 

.52 


.693 


n(±  1)  =  .64 

r-l  .64  +  .16 


n  (±  .  5 ) 
r-l 


.1215 

.1215  +  .01 


.925 


C  ,  . 
n  (<») 

Cn(±l) 

Cn(i.5) 

Cn(0) 


°2 (.693)  =  Cl. 386  ^  3,4 


C2(.8)  =C1.6"  3-8 


C2  ( .925)  C1.85  ^  4,3 


C2  =  4.60 


K  /  X 

n(“) 

Kn(±l) 

Kn(±.5) 

K 

n(0) 


r-l 


-v  4.9 


n(»)  n(«) 

-£±.  C 

n  (±1)  n(±l) 

-J-1— ■  c 

n(±  .5)  n  (±  .  5) 


*  4.75 


*  4.65 


C  .  =  4.60 
r-l 


The  component  functions  S^(z)  and  S2(z)  are  graphed  along  with  S(z) 

in  Figure  V  only  to  allow  comparison  with  T^fz)  and  T2(z).  It  is  never 

necessary  to  break  S(z)  into  r-l  parts. 

Although  K  ,  .  ,  the  critical  function  for  S(z),  is  not  a  horizontal 
n  (z) 

line  through  the  point  (0,  C  j.) ,  the  error  incurred  in  such  an  assumption 
is  small  in  this  example.  Let 


..  2„  .  ,  2z’2  +  z’  +  1.25 

S(z’)  =  a  K  =  4.6  -  - 

n(0)  . 6z ‘2  +  .2 


z  » 


1  i  /2.0032 
1.52 


1.59,  -.27 


Let 


S  ( z" )  *02K 


=  4.9  a 
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n{») 


2z"2  +  z"  t  1.25 
.6z"2  +  .2 


z 


u 


1  ±  /2.015 

1.88 


1.29,  -.22 


With  only  two  calculations  we  determine  that 


(-.22,  1.29)  C  Rc  C  (-.27,  1.59) 


where  the  symbol  fi,  is  used  to  represent  the  set  of  z's  such  that 
2 

S(z)  >  ct  •  Whether  or  not  more  precision  is  justifiable  considering 

the  approximate  nature  of  the  distribution  of  S(z)  may  be  subject  to  debate. 

Jf  more  precision  is  desired  in  the  upper  end  point  of  the  interval,  one 

might  solve  for  K  , .  ..  and  S(1.4). 
n(1.4) 


n(1.4) 

r-1 


1.89 

1.89  +  .615 


.753 


°n(1.4)  C1.506 


3.67 


Kn(1.4) 


3.67 

.753 


4.88 


S (1. 4) 


2(1.96)  +  1.4  +  1,25 
1.176  +  .2 


4.78  <  K 


n(1.4) 


4.88 


and 


S (1.29) 


n{1.29) 


Hence, 


(-.27.  1.29)  C  ft -  C  (-.27,  1.4)  . 

O 


In  a  similar  manner  any  desired  degree  of  precision  can  be  attained 
with  regard  to  either  end  point  of  the  interval  of  the  z's  which  constitutes 


37 


Example  la. 

2  4  3 

Assume  now  that  y1  ■  j  and  y2  “  J  •  maintaining  the  y  ■  y  of 

Example  1,  but  having  a  ratio  of  Is 2  rather  than  the  more  extreme  1:5 

ratio  of  Example  1. 

Then  letting 


S(2)  -  0  cr-l  '  Cr-1 


yields  the  same  interval  (-.21,  1.59).  But 


.  10  9 

S  *  25  '  25 


JL 

25  ' 


so  that 


n(<*>) 

r-1 


.36 

.40 


=  .9,  C 


1.8 


4.226,  oK 


n  (°°) 


4.226 

.9 


=  4.7 


Setting 


S(z')  =  4.7  z'  =  -.256,  1.47  . 


And  if  z  <1  from  theoretical  or  practical  considerations,  we 
would  look,  not  to  n(°°)  for  an  inner  limit,  but  to  n(±l). 


°2k„(±1)  *  “■66  *  slz"> 


It  /I  +  1.0125 
1.592 


-  -.262,  1.52 


2 

And  due  to  z  <1,  we  would  state  only  that  z  values  greater  than 
-.26  or  -.27  result  in  significant  block  differences.  Greater  precision 
can  be  achieved,  of  course,  from  iteration. 
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It  has  been  assumed,  thus  far,  that  and  S(z)  intersect  at  only 

two  points.  It  should  be  clear  that  this  will  be  the  case  unless  the  two 
curves  ciose.lv  parallel  each  other  where  K  ,  .  has  the  most  slope.  The 
following  counter  example  is  given. 


Examole  1.  Picture  VII. 


|'6  =  19,  k'k  =  4,  k'§  =  0, 


10 +  4 


This  is  a  valid  mathematical  counter  example  to  a  contention  that 
'-  si:  be  of  single  interval  form,  however,  two  practical  points  should 
be  made.  First,  the  S  value  for  this  example  is  very,  very  high.  This 
mean  ;  that  the  roots  are  very  unequal,  possibly  5  roots  equal  to  .0925  and 
one  -cot  'vviL  to  1.54  .  with  this  much  difference  in  the  roots  it  is 
doub  Tul  that  the  approximation  of  the  distribution  is  very  good.  And, 
second’",  .tie  real  error  would  be  introduced  in  this  problem  by  reporting 
any  whatsoever. 


humi"  of  ht .  u  ' 'as*--  of  icna  Covariate  Block  Means. 

•#  . ,  /o’*  all  blocks,  j’,j  =  1,  2,  •••  ,  r,  to  simplify 

J 

c  ' — '  ci .  vr  have  found  two  tests,  or  criteria,  for  generating  sets, 
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of  z  values  for  which  blocks  are  significantly  different.  By  adding 
x  .  to  all  z  values  we  form  the  sets  of  all  x  values  for  which  blocks  are 

*  j 

significantly  different.  That  is,  if  .67  e  then  (.67  +  x^)  e  rT  . 

And  if  .67  e  fi_  then  (.67  +  x  .)  e  r  . 

S  •  j  s 

The  sets  and  are  found  using  the  statistic  T(z),  a  sum  of  r-1 

fractions  whose  numerators  would  add  neatly  but  whose  denominators  contain 

2  2 

the  number  y^  which  may  differ  from  term  to  term;  and  T(z)  ^  a  Xr_^  ■ 

The  sets  T  and  fi  are  found  with  the  statistic  S(z)  which  averages 
s  s 

the  y^'s,  substitutes  y  for  each  y^  and  adds  the  terms.  S(z)  is  approximately 
2  r-1  2 

distributed  as  a  — tt  X  /  .  »  where  n(z)  is  a  correction  factor  based  on 
n(z)  n(z) 

| z | ,  the  differences  between  roots ,  and  t. 

Let  us  speak  now  of  a  third  statistic,  R(z) ,  which  equals  S(z),  but 
2  2 

R(z)  a.  a  Xy.^  •  Naturally  this  approximation  is  rougher  than  the  one  for 

S(z)  since  we  know  that  the  first  two  moments  do  not  exactly  fit,  unless 

n(z)  =  r-1  .  Then  the  sets  Q  and  T  will  be  generated  by  R(z) .  The  use 

R  R 

of  the  R(z)  approximation  derives  as  a  simplifi cation  of  T(z)  by  the 
"unconscious"  mathematical  manipulation  of  averaging  denominators  to  add 
numerators.  Likewise  R(z)  derives  as  a  simplification  of  S(z)  by  letting 
n(z)  =  r-1  .  How  good  an  approximation  T  will  be  of  r  when  the  roots 

K  1 

of  (I  -  —  J) R  ^{1  -  —  A'A)  ^(1 - J)  differ  cannot  be  determined  with- 

r  r  r 

out  a  great  deal  of  calculation.  But  limits  can  be  placed  on  the  approxi- 

2 

nation  of  T  to  T  by  solving  z'  when  S(z')  ■  o  k  where  n  is  the  value 

R  S  H]^  In 

of  n(z)  which  results  in  the  maximum  (or  possibly  minimum)  value  of  Kn  over 
the  range  of  z.  (In  general,  the  limit  on  the  approximation  is  found  by 
letting  S(s')  ■  o2K  .  .  .] 

The  sets  rT,  rg,  and  TR  are  obtainable  from  fig,  and  0R  only  when 
x^  -  x^,  for  all  blocks.  Finding  Tg,  and  TR  in  the  general  case 
requires  sane  additional  calculation. 


CHAPTER  VII 


THE  GENERAL  CASE 


1.  Development  of  a  Parametric  Statement  of  the  Null  Hypothesis 
As  in  the  special  case,  the  experimental  model  can  be  written  as 

(x,  .  -  x  .)  (x.  .  -  x  .) 

y.  .  *  u  +  A.  +  k  .  +  8  . r .  — " — A3...——— .3. +  - i2 - J— —  +  e 

13  i  3  j  ]  r - J  3  - 7 

Vl(xij  “  *.j>  V^ij  “  *.j> 

1  -  1,  2,  •••  .  t,  j  -  1,  2,  ■■■  ,  t,  -  *  /  l  «t-  0  I 


however,  the  ic  ^  used  here  may  appear  to  be  different  from  the  of 
Chapter  II  in  terms  of  the  parameters  of  Chapter  I.  That  is,  here 
Kj  »  Vj  +  $jX#j  +  !^(x#j),  whereas  in  the  special  case  ic..  =  y.  + 

Hcwever,  since  the  special  case  uses  the  additional  inestimable  equation 
y  <t> .  =  0  to  solve  the  normal  equations ,  it  should  be  evident  that  the 

j  3 

previous  is,  indeed,  a  "special  case"  of  the  defined  here. 

When  x  *  xQ  the  variation  in  E(jr)  from  block  to  block  is  contained 

in  the  expression  k.  +  (x  -  x  ,)8,  +  (x  -  x  .  From  this  expression 
1  J  J  j  0  J 

we  can  eliminate  x  <i<,  since  it  is  the  same  in  all  blocks,  and  for  mathematical 
convenience  we  add  x>%<i»  .  Then  the  expression  incorporating  all  block 

differences  becomes 


+  (x0  ’  x.j)3j  "  (x-j  “  * 
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Let 


VV  "  (X0  ■  “  (X-H  "  X..)*  • 


-.j 


The  notation  Pj(Xg)  is  intended  to  imply  that  is  a  function  of  x^,  not 
a  product. 


By  taking  the  inestimable  condition  \  (xQ  -  xt  .)B .  «  0  to  distinguish 

j  *  3  J 


block  regression  effects  from  the  mean  regression  effect,  we  see  that 
r 


\  Pj(xQ)  =  0  •  It  should  again  be  evident  that  the  special  case  condition 


l  Sj  *  0  is  included  in  the  condition  J  P_j(xQ)  "  0  when  x. j  “  x. ji  for  a11 
j  and  j'  . 

A  A  o^l 

As  before  “  y,j  -  y, ,  and  k  <v  Nftc,  — (1  -  —  j)  ] .  Just  as  in  the 

special  case,  we  find  that 


|  +  *1  -  R_1d  -  ^  A' A)-1A^[I  -  i(It  8  Jr)X  . 


Then 


P(xQ)  -  (I  -  £  JHXgl-diagU.^Hg+liH  -  txQI -diagU.^JB-  [(diag  x.^-x,.!]!* 


and 


P(xQ)  -  (I  -  “  J) [XqI  -  diagtx.j)] (B  +  li)  , 


from  which  we  see  that 


:+P(x0)'V'Il|!c  +  P(x0),  o2(I  -  i  J>(£  1+  tXQl-diagtx.^jHfx^-diagtx.^ljtr-i  J)} 


where 


•1  l  -1  -l 
-  R  A(x  -  r  A'A)  *r  1  . 
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Let 


or 


=  (xQI  -  D)M(xqI  -  D) 


W  =  x“M  -  x [DM  +  MLl  +  DMD  , 
xQ  u  0 


2 

where  D  =  diag(x  .).  (Note  that  W  reduces  to  (xrt  -  x  .)  M  in  the  special 
*3  Xo  0  *3 

case . )  Then 


K  +  P(xQ)  V  N [k  +  P(xQ) ,  o2(I  -  ^  J)  (i  I  +  Wx  ) (I  -  -  J)] 
The  hypothesis  of  no  block  diffe  nee  when  X  =  becomes 


Ho!  <  +  P (Xg)  =  0 

vs.  H  :  k  +  P(x  )  ?  0 

3  “  ”  U  ~ 


Again,  an 


0  0 

where 


2.  An  Exact  Test 


2 

exact  x  statistic  can  be  found  to  test  this  hypothesis. 

♦  iiVi'lfi*  n  -  7  J)wxo(I  -  7J1'l'1‘s  *  ?lxo”  '  Xr-i,x 


X  -  ~~l<  +  P(x0)]'[^  I  +  (I  "  ~  J)Wx  (I  ”  ~  J)f  1[k  +  P(xQ)]  , 

2o  0 

and  X  »  0  under  the  hypothesis  HQ  . 

Unfortunately  when  (I  -  —  J)  (W  )  (I  -  —  J)  is  expanded  in  eigenvectors 

r  x0  r 

the  vectors  are  not  independent  of  x^  as  in  the  special  case.  To  emphasize 
this  point  the  subscript  x  will  be  used  for  the  roots  and  vectors.  The 
functional  notation  \r(x)  becomes  cumbersome  as  doc.  the  retention  of  the 
notation  xQ  which  has  been  used  to  emphasize  the  fact  that  the  test  is 


made  at  a  specific  point  and  then  the  inquiries  are  made  as  to  which  X 
values  would  cause  the  test  statistic  to  be  higher  than  some  critical 
function.  Hopefully  this  point  has  been  made  well  enough  to  allcw  the 
terminology  v^,  and  y^  to  adequately  communicate  the  idea  that  these 
roots  and  vectors  are  dependent  on  a  specific  value  of  X  at  which  the  test 
is  to  be  made.  Since 


(I .  i  ,)Vi  - 1  j) 


r-1 

l 


i=l 


y  .v  .  v' . 
XI -XI -xi 


then 

A  A 

[K  +  P(X)  ]  . 

It  should  be  apparent  that  it  is  possible  for  y^  =  y  , ,  for  all  i  and  i* 

and  for  all  x  values  only  in  the  special  case  previously  considered. 

2 

Hence,  solving  for  rT#  the  set  of  x's  for  which  Ty(x)  >  a  Cr_^  a 
becomes  a  trial  and  error  ta3k  which  generally  involves  the  inversion  of 
the  matrix 


r-1 


T^x)  =  -y  Ik  +  P(x)]'  l 


i-1 


v  ,  v' . 
-xi-xi 


1 

v  .  +  — 
Txi  t 


1 

t 


I  +  (I  -“■<!)  [>2M  -  x(DM  +  MD)  +  DMD]  (I  -  ^  J) 


at  each  trial. 

Recall  that  we  cannot  assume  that  r  is  of  the  interval  or  interval 
complement  form  so  that  finding  two  end  points  of  intervals  in  does  not 
mean  we  have  found  all  of  . 

Obviously  the  Ty statistic  has  the  theoretical  disadvantages  (resulting 
from  the  uneven  weighting  of  contrasts)  that  the  t  statistic  was  found  to 
have.  But  in  addition,  the  determination  of  TT  will  be  tedious  and  expensive 
If  there  was  reason  to  seek  a  second  test  in  tire  special  case,  there  is  even 
more  reason  in  the  general  case. 
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3.  An  Approximate  Ter.*: 

Again  an  approximate  cert,  for  the  hypothec  .•  .4 

H0:  K  +  ^(XQ5  0 

VS  .  Ha:  <  +  P{}Cq)  7^  0 

can  be  formulated,  which  depends  on  [k  +  P(Xq)]'[k  +  P(x^)1. 

Let 

sr(x0)  =  Ik  +  PCXqJ  1 '  [k  +  P(xQ)]  =  v'fH  +  L(x0)3'[H  +  Lfx^]^ 
where 

H  =  i  .1  9  (I  -  -  J) 
t  -  r  r 

and 

L(xQ)  =  (I  -  j  J)  (xQI  -  D)MRA^(Irt  -  [It  0  ijrl)  . 

2 

T^(xq)  is  a  central  x  under  the  null  hypothesis.  S* (xQ)  is  not  a 

2 

X  but  it  is  "central"  in  the  sense  that  y'[H  +  L(x0)J'[H  +  L(xq)]ij  = 

[k  +  P(Xq))'[k  +  P(xQ)l  s>  0  only  when  kt  +  F(xQ)  ~  0  and  again  U  =  E(j[). 

Like  o*(z)  the  statistic  S*(x)  can  be  decomposed  as  a  sum  of  indepen¬ 
dent  x“  statistics.  Let 

r-J 

[H  +  L(xrt) ] ' [H  +  L(x_) ]  -  I  K  J ■  .4*. 

1  0  0  L.  xi-xi-xi 

l-l 

where 


l  ,,  -  1 
-xi-xi 


-  0 


i  -  i » 


i  ¥  i' 
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or 


m  (x)2o  l  (Yxi  +  £) 

7  ll2 

0  |i  Yxi  +  rj 


2m(x) 


Thus 


m(x) 


“r-l  , " 

\ 


r-l 


4  2 


1.2 

r  -  l  2 

(\  *  t 

=  r-l 

<Yx  +  t> 

1.2  »  -2. 

-  1. 2 

_(YX  *  *  <rx  -  Yx)_ 

_(Yx  *  t >  *  5x. 

where 


1  r_1 

Tx*pr  ?  T*i 

i 

r-l  „ 


1  r 

Yx  r-l  \  Yxi 


hence 


Let 


.  -  -2 
6  =  Y  -  Y 

X  'x  X 


0  <  &x<  (r-2) , 


1  <  m(x)  <  r-l  . 


'  X  t 


then 


4  *vx)  ^ 


—  x2 

m(x)  m(x) 


recalling  that 
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[<  +  P(x)J'[K  +  P(x)  ] 


(1)  tyx)  - - - - J - 

Yx  +  t 

S^x)  ■  T^j[x)  only  for  those  x's,  if  any,  such  thaty^  =  Y^p 
all  i  and  1'  .  Now 

(r-l)y“x  -  trace  of(l  -  j-  J)Wx(i  -  j  J)  >  0 

■  trd  -  7  J)  [x2M  -  x (MD  +  DM)  +  DMD]  (I  -  7  J) 

-  x2  trd  -  -  J)M  -  2x  trd  -  -  J) MD  +  trd  -  -  J)dmd 
r  r  r 


Let 


where 


and 


where 


Let 


t  * 
*2* 


+  A^x  + 


X 

X 

X 


2 

1 

0 


trd  -  -  J)M 
r-1  r 

-  ~r  tr{I  -  -  J)MD 

—  trd  -  i  J)DMD  f 


P(x)  -  (I  -  ~  J)  (xl  -  D)Gy 

G  -  R’1(I  -  —  A'A)Al(I  -  7  (I.  «  J  ) J 
r  a  r  z  x 

P(x)  -  x(I  -  i  J)  Gjj  -  (I  -  ~  J) DGy  . 


ii  -  d  -  i  J)G* 


for 
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and 

i0  -  -«  “  7  J)DGX 

thus 


p(x)  »  n  x  +  nQ  . 


Then 

x2n^!  +  2xn^(n0  +  <)  +  (nQ  +  <)  •  (nQ  +  <) 

(2)  Sjx)  «  — = - - - - - : - ■— 

x  X2  +  xX1  +  (XQ  +  -) 

When  djltx)  is  expressed  in  the  first  form  (1),  it  is  difficult  to  see 
how  values  of  x  affect  the  value  of  S^x) .  However,  in  the  second  form  (2) , 
it  is  apparent  that  S^x)  is  again  a  ratio  of  two  quadratic  functions  in  x. 

Again  S^x)  is  continuous  and  has  at  most  two  extreme  points,  although 
those  two  points  can  no  longer  be  expected  to  lie  on  either  side  of  0  as 
with  S(z) . 

Looking  at  it  can  be  seen  that  S^x)  takes  its  "shape"  from  the 

A  A  e  A  A  A  A 

sign  of  Q  ■  +  rat^er  than  siitply  from  -  x'g  as  in  the 

special  case.  If  Q  >  0,  then  >  0  for  very  large  values  of  |x|  which 

oX 

implies  that  S(x)  has  a  maximum  to  the  left  of  its  minimum. 


Likewise,  if  Q  <  0, 
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And  if  Q  =  0,  and  >  — i-y  (n_0  +  <)  '  (nQ  +  k)  -  S^O) 

2  X0  +  t 


S/x) 


Similarly,  if  Q  =  0  and  S^")  <  S^O) ,  S^x)  is  again  symmetric  about  xc» 
but  concave  downward.  It  will  be  noted  that 


lim  i  y'G'  (I  -  r  J)Gy 

W  -  *f*>  -  s,<"> - T-1 — 


again  the  test  for  regression  effects  to  which  Cox  [1]  refers. 

However,  S(0)  is  not  the  test  statistic  for  fixed  effects.  If 
X  j  «  x#j,  for  all  j  and  j',  then  S^x^)  would  be  equal  to  S(0).  Thus 
t^x)  is  shaped  much  as  S(x)  ,  and  can  be  easily  solved  for  its  two  inter¬ 
sections  (if  any)  with  any  horizontal  line.  The  general  critical  function 

°2  Sr  *»<«>  U  not-  ho”e",r'  *ha)?e'J  “  0,8  critical  tmctim  f°r  the 

special  case.  In  the  special  case,  n(z)  decreased  monotonically  with  |z| 

so  that  when  —  C  was  monotonic  with  n,  the  critical  function  was  monotonic 
n  n 


with  I z | 

Recall  that 


Here  the  functions  —  C  are  the  same,  but  m(x)  is  not  unimodal. 

n  n 


2 

where  £  y ^  and  £  y^  are  the  sum.  of  roots  and  sum  of  squares  of  roots  of 
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the  matrix  (I  -  ^  J) [x2M  -  x(MD  +  DM)  +  DMD) (I  -  j;  J) .  These  roots  will 
vary  with  X  and  certain  X  values  will  produce  more  unbalanced  roots  than 
others.  In  order  to  evaluate  the  variation  of  m(x)  with  X  it  will  be 
necessary  to  rewrite  m(x)  in  a  form  which  incorporates  the  functional 
dependence  of  the  roots  on  X.  This  can  be  done  by  noting  that  the  sum  of 
the  roots  is  the  trace  of  the  matrix  and  the  sum  of  the  squares  of  the 
roots  is  the  trace  of  the  square  of  the  mitrix. 

— r  J  y  ,  =  y  =  +  X,x  + 

r-l  L  'xi  'x  2  1  0 

where  the  AQ,  and  are  defined  for  establishing  the  second  form  (2) 
of  S^x) .  Now 


Thus 
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where 

^4  3  r-1  tr[(I‘r  J)M(I  "  r  J)M]  ^3  “  "  r^T  tr[  (I  “  r  J)M{I  “  r  J)MD] 

{tr{  (I  -  —  J)M(I  -  —  J)DMD]  +  tr[  (I  -  -  J)MD(I-  -  J)  (MD+  DM)]} 

i  r- 1  r  r  r  r 

tr[(I-i  J)MD{I-  i  J) DMD]  if>Q  =  -~y  trt  (I  J)DMD(I  -i  J) DMD] 

Then 


[A2x2+X  jttA  +~]2  where  g(x)  -  (^-aJ)x4^,-2X, XJx3 
m(x)  »  r-1  - -  42  3  12 

U2x  +A1x+Xq+^]  +g(x)  +  (^2*X2Xo'Al)x2+^l+XlVX+%+A0) 

We  note  that  m(x)  is  the  ratio  of  two  expressions  which  are  4^  degree  in  X. 

As  such  it  can  have  four  distinct  critical ,  or  extreme,  points.  It  is 
bounded  above  by  r-1  but  it  is  difficult  to  establish  a  useful  lower 


bound.  Since 


r-1  2  /r-1  %2 

{  yxi  <  1  1  yxl }  ,M  “y 


x,  we  knm  that  m(x)  >  1  .  It 


was  shown  in  Chapter  VI  that  a  ■  .25,  the  critical  function  is  insensitive 

to  changes  in  m(x)  and  the  lower  bound  m(x)  «  1  may  be  adequate  to  bound 

m(x)  and  hence  bound  C  .  .  and  hence  bound  T  ,  the  set  of  x’s  for  ,'»hich 

mix)  mix)  s 

blocks  are  significantly  different.  But  when  a  is  small  and  there  are  a 
large  number  of  blocks,  the  m(x)  -  1  bound  may  well  be  too  low  to  be  of 
value.  The  m(x)  -  1  result  is  equivalent  to  the  extreme  case  of  all  roots 
equal  to  0  except  one.  That  any  value  of  X  could  cause  this  sort  of  dis¬ 


tortion  o£  the  matrix  becomes  more  and  more  unlikely  as  r,  ti.e  size  of 
the  matrix,  becomes  larger. 
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This  should  not  be  surprising  since  as  x  becomes  increasingly  large  any 
x  .  subtracted  from  it  would  become  immaterial. 

*  j 

Hence,  m(x)  is  asymptotic  to  m(~)  and  the  corresponding  critical 
2  2  r-1 

function  is  bounded  by  a  C  ,  and  asymptotic  to  c  ~~r-r  C  .  ,  .  The  function 

r-1  m(“)  m(“j 

m(x)  can  have  a  minimum  lower  than  m(“) ,  though  it  will  be  greater  than  1. 

A  great  deal  of  effort  can  be  expended  finding  the  intersection  of  the  $^x) 
function  and  the  critical  function.  Consider  an  example. 
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♦4 " 

.0452 

il/  -  X2 
V4  2 

= 

.00174 

*3- 

-.0174 

*3  “  2X1X2 

m 

-.0087 

*2“ 

.0318 

<l>2  -  2A2A0  +  A2 

ts 

.0163 

h- 

-.00232 

*1  *  2X1X0 

s 

-.00073 

*0- 

.00163 

-  A2 
r0  0 

m 

.00017 

6  *  Y  -  Y2  -  .00174x4  -  .00864x3  +  .0163x2  -  .00073x  +  .00017 

X  'x  'x 


m(x) 


(.208x2  -  . 0208x + .2043) 2  +  .00174x4-  .00864x3  +  .0163x2  -  .00073x + .00017 


The  function  m(x)  is  plotted  in  Figure  VIII  (Appendix) .  We  see  that 
m(O)  is  very  cloe>»  to  r-1,  that  is,  2  .  This  is  because  the  x.^'s  °*  the 
example  sum  tr  0  .  The  curve  is  asymptotic  to  1.92  .  However,  the  minimum 
value  of  the  curve  occuni  between  -1  and  -2  .  This  means  that  X  values 
between  -1  and  -2  cause  the  roots  of  the  matrix  to  be  relatively 
further  apart  than  any  other  X  values.  Such  a  plot  may  be  disconcerting 

but  its  effect  on  the  problem  at  hand,  i.e.,  the  finding  of  I*  ,  can  be 

“Sr 

vary  small. 

r-1 

In  Figure  VIII,  above  the  m(x)  curve,  are  the  cm(x)  curves  for 
a  -  .05  and  a  ■  .10  .  Since  a2  is  taken  equal  to  1  in  this  example,  these 
are  the  critical  curves  for  evaluating  fyx)  at  the  .05  and  .10  levels, 
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respectively.  When  ra  =  .10  the  total  range  of  the  critical  function  is 
.012  units ,  and  clearly  it  would  be  much  less  for  a  -  .25  .  When  a  =  .05, 
the  range  is  .24  units.  Even  if  a  =  .05,  this  slightly  undulating  critical 
function  may  not  cause  appreciable  trouble. 

Consider  the  following  values  for  the  estimates  of  the  parameters 
which  comprise  S^(x)  .  Let 


then 


S^x) 


x2n^nx  +  x[2n^n0  +  <n  +  (nQ  +  <)'(n0  +  <) 

x2X2  +  *X1  +  *0  +  £ 


and 


S7(x)  = 


„  2  2 
2x  -  2x  +  - 


.  208x  -  . 0208x  +  .2049 


in  Example  4  (see  Figure  IX) . 

As  a  first  step  in  finding  Ts  we  find  Fr  by  equating  S^x)  to  a”cr_1 
for  a  •  .05  . 


2~ 


S^(X' 


)  »  5.992  - 


2x’2  -  2x*  +  .667  _ 

. 20  3x' ^  -  ,0  2  80x*  +  .20  49 


(Recall  that  r-1  ■  2  .) 


x' 


2.76,  -.27 
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Noting  that  &J.Q)  <  5.99  and  .27  <  0  <  2.76  we  state  that 
T  »  T7  -(-.21,  2.76) 

R 


T_  is  an  upoer  bound  on  T.  in  the  sense  that  TCT  .  Mote  •’-hat 
R  a  S  h 

use  of  Fr  means  that  the  true  <x-T.evel  for  the.  test  is  more  than  the  stated 
.05  level. 

A  second  step  toward  finding  Ts  would  be  to  so.lvre  Sy(x")  -  (r-l'C^  . 
Again,  u  ~  .05  . 


2x"2  -  2x"  t  .667 _ 

. 208x"2  -  .0208x"  +  .2049 


7.68 


x"  *  5.05,  -.45 

r  Lower  Pound  .  .  ,  =  U  -(-.45,  3.05) 
m(x)=l 

The  fact  that  this  lower  bound  is  not  close  to  I’  means  that  it  will 

R 

he  necessary  to  find  m(-.27)  and  m{2.76).  ’.Cnowing  that  the  interval  found 

by  using  m(x)  =  1  is  a  very  poor  limiting  value,  we  can  expect  the  true 

set,  r  ,  to  be  close  to  T  . 

S  R 

Finding  the  values  of  m(-.27)  and  m(2.76)  requires  the  determination  of 
the^,  i and  values.  Had  they  not  previously  been  determined 
in  order  to  plot  Figure  VIII,  it  would  be  necessary  to  find  them  at  this 
time. 


x 


(-.27,  2.7-’) 
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This  result  would  oe  sufficient,  in  most  cases,  for  us  to  stat ; 

that  r  =  TT  -('.7.7,  2.  76)  . 
s 

Figure  IX  shows  the  critical  curve  for  Example  \  for  a  values  of 

.25,  .10,  .05,  .025,  and  .01  .  These  are  presented  in  order  to  show  the 

relatively  small  amount  of  difference  in  the  a-level  which  would  result 

from  using  r  as  an  approximation  to  T  .  However,  this  is  admittedly 
R  S 

only  one  example  and  a  great  deal  of  work  can  be  done  on  the  matter  of 

finding  a  realistic  lower  limit  for  the  critical  function.  It  must  be 

rementoered  that  if  the  eigenroots  of  the  M  matrix  are  all  near  zero  except 

2 

for  one,  the  critical  function  could  rise  very  nearly  to  o  (r-l)C^  . 

Hence,  it  would  be  very  desirable  to  find  a  quick  and  easy  lower  limit  for 
the  critical  function  in  terms  of  the  matrix  M  and  the  diagonal  matrix  D. 


At  this  point  it  can  be  said  only  that  the  use  of  the  critical 

2  2  r-1 

function  a  cr  ^  seems  to  give  very  good  approximations  to  a  cm(x) 


unless  the  M  matrix  has  only  one  root  appreciably  different  from  zero. 


CHAPTER  VIII 


UNKNOWN  VARIANCE 

■> 

When  the  error  variance  a*"  must  be  estimated  from  the  experimental 

data,  an  F-statistic  can  oe  formed  with  T^x)  and  MSE,  or  ar.  approximat ; 

F-statistic  can  be  formed  with  S^x)  and  HSE. 

Both  T^x)  and  S^(x,  are  independent  of  MSE,  since  he  matrix  of  the 

quadratic  form  of  SSF.  is  orthogonal  to  the  k,  6,  and  B  functions  of  the 

y„'s  from  which  Tbfx)  and  S^(x)  are  formed. 

o 

T^x)  ,  being  a  true  •/”  ^  ,  can  be  used  to  find  a  true  F-statistic, 

namely 


F  (x)  =  /T~ ^  v  F 

V '  MSE  r-1 ,  rt-2r-t+l ,  A 


where 


X  -  ~[k  +  P(x)]'[“  I 


+  (I  -  -  J)W  (I  -  —  +  P(X)  ] 

r  x  r  - 


Everything  that  was  said  about  Xjlx)  can  be  reiterated  for  F^fx)  . 
Unless  conditions  for  the  special  case  hold,  the  calculation  of  each  point 
of  Ft(x)  will  require  inversion  of  an  r  1  r  matrix.  The  power  of  FT(x) 
will  be  greater  against  some  specific  alternate  hypotheses  than  others,  as 
determined  by  the  eigenvectors  and  eigen roots  of  the  matrix  o;  the  quadrati 
form. 

At  the  cost  of  knowing  the  exact  distribution  of  the  statistic.  these 
faults  can  be  corrected  by  formulating,  with  :y(x)  ,  an  up?  roximate  central 


?8 
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F-statistic  under  the  null  hypothesis,  namely 


Ft.(x) 


MSE  m(x) ,q 


MSE 


whe  re 


q=rt-2r-t+l 


As  in  the  consideration  of  S^x)  ,  the  two  values  of  x  (if  any)  at 

which  F_(x)  equals  any  critical  value  of  an  F-statistic  can  readily  be 
s 

found.  However,  the  critical  function  for  evaluating  F^fx)  is  not  a 
horizontal  line  although  it  may  be  nearly  so  in  most  cases.  To  consider 
the  variation  of  the  critical  function  with  x,  it  is  necessary  to  consider 
the  variation  of  the  critical  points  of  a  F-statistic  with  v^,  the  numera¬ 
tor  degrees  cf  freedom;  and  the  variation  of  the  numerator  degrees  of 


freedom  -  m(x)  with  x 


Define  f  by  \ 
u^q  7  * 


fv 


F  =oi 


Note  that  C  =  f 
n  n, 


considered  the  variation  of  f 
that  f 

Vl  ,® 


We  have 

with  v  in  Chapter  VI.  It  was  found 


v1,°°  1 

was  a  monotonic  decreasing  function  function  of  v  for  small  a. 


and  a  monotonic  increasing  function  of  for  u  larger  than  .25  .  When 


a  =  .25  the  largest  value  of  f  is 

'1,® 


found  at  v,  =  2  . 


In  Figures  X,  XI  and  XII,  values  of  f  are  plotted  against  —  . 

vpq  vj_ 

It  will  be  noted  that  the  slopes  of  these  curves  are  "quite  constant,"  the 

variation  being  in  the  same  direction  as  before  when  y,  -•  q  is  not  small. 

When  is  1  or  2,  the  critical  function  actually  increases  with  .  The 

constant  slopes  of  these  curve'  indicated  that  linear  interpolation  should 

oe  done  with  ~  rather  than  with  v.  . 

V1  1 

Having  established  that  the  critical  function  f  r  '  , (xl  will  bo 
generally  monotonic  with  m(x)  ,  the  only  exception  beinr  with  a 
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large  number  of  degrees  of  freedom  for  error,  attention  is  again  focused 
on  the  variation  of  m(x)  with  x  .  In  the  special  case  of  equal  covariate 
block  means,  m(x)  is  monotonic  with  |x  -  x#  J  ,  so  that  limits  can  be 
placed  on  the  critical  function  by  m(^)  and  r-1  .  However,  the  same  prob¬ 
lem  of  finding  a  lower  limit  for  rn(x)  in  the  general  case  that  was  dis¬ 
cussed  in  Chapter  VII  occurs  here.  When  a  is  small,  the  critical  function 

f  ,  .  will  reflect  the  shape  of  the  m(x)  curve.  The  closeness  of  F ,  to 
m(x),q  e  S 

F  can  be  established  only  by  solving,  iterative] y,  for  the  intersections 
R 

of  F„ (x)  and  f  .  .  .  Consider  an  example. 

S  ir.(x)  ,q 

Example  4.  (Continued  from  Chapter  VII). 

2 

r  =  3,  t  =  6,  a  unknown,  MSE  =  .60,  a  =  .05 


2x  +  .667 
0208x  +  .2049 


Fs(x)  = 


_ 1 

2  (.6) 


\n(x)  ,rt-2r-t+l 


under  Hr 


fr-l,rt--2r-t+l  f2,7  4'74 

To  find  T  solve  S(x)  =  1.2(4.74)  =  5.69  for  x;  this  yields 
R  Y 

x  *  -.24,  2.57  .  Hence,  r  =  U  -(-.24,  2.57). 

R 

Next  find  m(-.24)  and  m(2.57)  . 


bl 


m(-.24)  = 


2 ( .2219) 


=  1.94 


•1.94,7 


(.2219)  +  .0014 

4.76 


S(x)  =  1.2(4.76) 


solving,  x  =  -.24 


r' 

|  m(2. 57) 


2 ( 1. 524) 2 _ 

(1.524) 2  +  .0353 


2(2.32) 

2.36 


f1.97,7 


4.75 


S^x)  =  1.2(4.75,  =  5.70 


solving,  x  =  2.55 


r 


s 


Then 


U  -(-.24,  2,56) 


CHAPTER  IX 


EVALUATION  OF  TIIE  TESTS 


It  is  desired  to  find  a  cast  for  block  differences  for  an  experimental 
model  which  has  block  effects  with  a  fixed  portion  added  to  a  regression 
portion. 

2  2 

In  looking  for  a  test  one  turns  naturally  to  a  x  test  when  o  is 
2 

known  or  an  F-test  when  a  is  estimated.  To  study  the  variation  of  the 
test  statistic  with  the  covariate,  the  matrix  of  the  quadratic  form  is 


expanded  in  eigenvectors . 


r-i  ft  *  rlxll'y^)  2 

rrr— ' Vl 

o  c  1=1  XI  t 


It  is  noted  that  the  numerators  of  the  fractions  would  add  nicely  to 

A  A  A  A 

[k  +  p(x)  ] '  [k  P(x)  ]  ,  but  with  unequal  y^'s  the  denominators  of  the 

fractions  such  an  operation  is  not  permissible.  Clearly  an  upper  limit 

on  T^x)  could  be  found  using  the  minimum  y  and  a  lower  limit  would 

result  using  the  maximum  of  the  y  . 's  .  This  assumes,  of  course,  that  the 

xx 

Y^'s  are  known.  The  testation  to  average  the  y^'s  would  occur  to  an 
applied  person  who  had  not  been  too  thoroughly  influenced  by  the  mathe¬ 
matical  quest  for  exactness. 

The  applied  mathematician  might  very  well  average  the  y^'s  to  g«t 

an  approximation  of  "Vx) .  Then  if  he  wore  asktd  "Does  your  approximation 

2 

of  Vx'  have  the  same  critical  point  (or  function)  j?"  he  fti^t  aay 
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"Only  if  the  y  .  's  are  exaci.lv  alike,  otherwise  1  hedge  a  little.  That 
xi 

is,  where  the  y  .  's  are  very'  different  it  takes  more  to  surprise  me  at  the 
xt 

sairi'  tf  -level.  However,  in  ail  honesty,  if  I  have  no  reason  to  believe  the 

2 

roots  are  very  different,  I  lust  qo  ahead  and  use  a  C  ,  . " 

r-1 

The  statistic 


S^x) 


Ik 


+  P  (x)  ]  '  +  P  (x)  ]  V 


r-1  ( 

m(x)  ^(x) 


where 


m(x) 


does  the  averaging  and  hedging  with  the  mathematical  justification  of 

fitting  a  multiple  of  a  central  ^  with  a  certain  number  of  degrees  of 

freedom.  With  this  justification  for  what  the  applied  mathematician  might 

do  instinctively,  the  test  is  found  tc  t-  insensitive  to  differences  in 

roots  when  x  is  large.  When  a  is  small,  checks  (finding  the  , 

y  ,  y  ,  and  hence  m(x)  values  defined  in  Chapter  VI)  are  tedious,  but  not 
3  4 

2 

nearly  so  tedious  as  solving  Tj.x)  =  o  ^  for  x  . 

The  expansion  of  Tyjx)  has  another  result.  Tt  shews  that  the  x) 
statistic  is  weighting  the  various  contrasts  of  the  block  unevenly  as  we 
have  more  or  less  in  format i on  about  thorn.  < -ranting  the  general  wisdom  of 
such  a  mow,  if  we  are  equal  lv  interested  in  all  of  Mv  blocks,  lyx)  is 
not  really  t •  •  1 1  i n*|  us  what  we  want  to  know.  That  is,  yx'  gives  on  exact 
answer  to  t,h>'  approximate  preb-h-m  at  hand,  b^x)  »  ■  si  t  (-.*•  other  hand,  gives 
on  approximate  answer  to  *  H--  <-xa-t .  prof  b-r  .,t  hand,  j  •:  mo  tv  in  line 

with,  the  desires  of  rhe  statistician. 
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The  T-^x)  test  is  uniformly  most  powerful  against  a  set  of  alternatives 
which  ray  not  be  of  particular  interest.  Of  interest  in  regard  to  this 
point  is  work  by  Reisch  and  Webster  [6],  Scheffe  [8]  and  Ksu  [4].  The  re¬ 
direction  of  power  offered  by  f^/x)  should  mere  than  compensate  for  the 
errors  incurred  by  the  approximate  nature  of  the  test. 

The  JV'x)  test  which,  is  the  S(x)  statistic  evaluated  against  the 
T  T 

critical  function  of  T-T x)  i  -  a  further  approximation.  Rjx)  is  a  very  good 
approximation  of  S^(x)  for  hign  u -levels  and  seems  to  also  give  good  results 
when  a  is  low.  I^(x)  has  the  further  intuitively  appealing  characteristic 
of  always  yielding  a  set  of  X  values  for  which  the  test  is  significant 
which  is  of  the  interval  or  interval-complement  form. 

The  test  R(x)  and  the  resulting  set  of  X  values,  i  p,  are  simple 
and  practical  tools  which  should  find  use  in  industry  and  in  the  social 
sciences.  They  are  well  suited  to  the  large  a  situations  in  which  one  is 
particularly  concerned  (worried)  about  not  recognizing  block  differences 
when  they  occur.  The  usefulness  of  these  tools  in  scientific  research 
would  be  increased  immeasurably  with  a  lower  bound  for  m(xi  when  the  co¬ 
variate  block  means  are  not  equal  and  with  some  study  into  the  closeness 
of  the  approximating  distribution. 

Although  this  work  has  been  developed  for  a  two-way  classification 
rode  1  with  a  covariate,  it  is  equally  valid  when  the  treatment  effects  are 
0.  Then  wo  arc  talking  about  comparing  simple  regressions  based  on  t 
replications  fror  t  different  sources  and  the  M  matrix  is  diagonal  ir.  that 
case. 

Work  is  currently  being  dene  l<v  the  author  to  extend  these  results  to 
con^'are  multiple  regression:*  from  different  sources.  Tn  these  cases,  the 
■t-typ**  statistic  addresses  itsclr  more  directly  to  the  i  r-  torn  at  hand  than 
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does  a  T-type  statistic,  and  it  will  greatly  simplify  the  problem  of 
finding  sets  of  points  for  which  the  test  is  significant.  However,  there 
may  be  a  nunfcer  of  other  cases  in  which  one  would  rather  not  accept  the 
unequally  weighted  contrasts  of  the  conventional  sum  of  squares  for  re¬ 
gression  test.  Armed  with  some  facts  about  the  closeness  of  the  approxi¬ 
mations  it  is  possible  than  one  might  wage  war  on  many  fronts  against  the 
wide-spread  use  of  0*(X'X)£  to  test  the  effects  of  0  when  ^  =  +  e  . 


APPENDIX  I 


THE  MATRIX  (I  -  -  J) (I  -  -  A'A)”1(I  -  -  J) 
r  -  r 

The  matrix  (I  -  7  J)  (I  -  —  A'A)  1(I  -  —  J)  where  A  =  [a.  J.  „  and 
r  r  r  ij  txr 


x.  .  -  y. 


ii  • i  .  ,  fa  m  2 

aij  =  — (qX)  —  where  (qx)  ^  "  x.  j)  occurs  in  this  work  as  a 

special  case  of  the  matrix  (I  -  —  J)M(I  -  —  J)  where  M  =  R-1(l  -  •-  A'A)_j'r"’1 


i,U 

and  R  is  a  diagonal  matrix  with  j  element  (qx)  ^  .  If  the  covariate  values 
are  controllable,  he  elements  of  R  can  be  expected  to  be  made  very  close 


to  the  same  values.  In  Cox's  example  [1]  the  covariate  values  in  each 


block  are  the  digits  1,  2,  3,  •••  ,  t  so  that  R  would  be  the  constant 


-  (t+1)  (t-1)  . .  . . 

- ~ -  times  an  identity  matrix. 


The  R  matrix  is  clearly  nonsingular  so  the  nonsingularity  of  M 
depends  on  the  matrix  (I  -  ^  A'A) .  Singularity  of  this  matrix  could 
result  only  from  A'A  being  of  rank  one.  This  follows  from  the  positive 

semi-definite  nature  of  A'A  and  that  its  trace  equals  r  . 

1  r  1  r 

Let  —  A'A  «*  T  iv.v'  ,  then  {I - A'A)  »  \  (1  -  X.)v.v!  ,  where 

jt  ,  1"*jl**i  r  , u  x  -  i-i 

i  i^l 

I  X.  -  1  and  0  <  X.  <1  for  all  i.  Hence  the  only  way  to  produce  a  0-root 
i  1  1  ' 
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of  (I  -  —  A'A)  is  for  one  .  to  be  equal  to  1  and  the  others  to  be  0.  In 

12  t 

that  case  —  A'A  =  lv. v!  where  v,  ,=  1/r  for  all  k.  That  is,  T  a,  .a,  ,  .  =1 

r  -i-i  ik  “  xj  i'j 

for  all  i  and  i'  .  In  other  words,  each  vector,  vith  elements  a,  is 
identical  to  all  other  a ^ ,  vectors,  or  to  their  negative  value.  Symbolically, 
A  =  (a,  -a,  -a,  a,  •••  ,  a) .  If  this  were  the  case,  one  linear  function 
of  p  /s  and  i\i  would  be  hopelessly  confounded  with  one  linear  function  of 
treatments,  ♦■hus  accounting  for  the  singularity. 

If  the  covariate  is  uncontrollable,  such  an  occurrence  is  most  im¬ 
probable,  and  if  the  covariate  is  controllable  the  situation  will  be 
avoided. 

One  desirable  case  of  singularity  should  be  noted  here.  If  A'A  =  J 
then  ^  will  be  confounded  with  treatments  leaving  the  3^'s  estimable.  Such 
mathematical  neatness  is  only  possible  if  the  covariate  is  controllable, 
and  only  desirable  if  there  is  no  interest  in  examining  treatments  or  mean 
regression. 

Under  the  above  very  rigid  restrictions  the  A  matrix  could  be  written 
as  [diag(a.)]J  or  A  =  (a,  e,  a,  a,  •••  ,  a)  where  the  elements  of  the 

1  IT  “  “  "  *■ 

vector  a  are  a_^  .  Then  equation  (3)  of  Chapter  II  becomes 


I  a.y . 1  I  a,  \  y.  . 

r  i  il  .  i  .  in 

X  j.  'J 

I  a,v,,  )  a.  }  y .  . 

~  i  i2  r  i  .  ii 


1 

Y 


Y  a,y.  Y  *•  I  y 

r  i  lr  r  i  “  • ; 


L  i 


i  3 


=  u)2ll  -  ij)(B  +  i>l) 


where 


R  ■  wl 
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Then 


r-1 


I  -  —  A'A  -  -^-(1~p)  [-  11']+  l  V  V’ 

r  r  *r  -  -  .  ,  r  -i-i 


i=l 


and 


1  i  i  r-1 

(I  -  -  A'A)"1  =  7- .f.- — r[-ll']+  V  — ~T  v.  v.' 
r  (r-l)(l-p)  r  —  ^  r-(l-p)  -i-i 


The  full  matrix  (I  -  —  J)  (I  -  —  *»'A)  ^(1  -  —  J) then  becomes 

r  r  r 


r-1 


i-1 


v.  v! 


r-(l-p)  -i-i 


The  matrix  has  r-1  identical  eigenroots  of  magnitude 

r-l 

These  roots  are  between  1  and  resulting  in  a  trace  between  r-1  and 

(r-1)2 

r-2 

That  is,  even  a  p  value  very  near  1  does  not  destroy  the  balance  of 
the  covariance  matrix  although  it  makes  the  I  -  ~  A'A  matrix  very  nearly 
singular. 

However,  this  property  is  lost  if  the  A'A  matrix  is  of  a  form  such  as 


Then 
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The  roots  of  this  matrix  are  not  balanced  and  the  trace  of  this  matrix  is 


(r-l)r 

r-(l-p) 


In  this  form,  it  is  apparent  that  the  trace  of  the  covariance  matrix  can 
be  made  as  large  as  anyone  chooses  by  making  p  very  close  to  1  .  If  P  is 
very  close  to  1,  the  primary  eigenvector  of  the  A' A  matrix  would  be 
-(-1111)  . 

These  two  illustrations  of  near  singularity  are  presented  to  aid  the 
reader  in  understanding  the  types  of  A1 A  matrices  which  will  yield  un¬ 
balanced  eigenroots  and  large  trace  values  for  the  covariance  matrices  of 
block  regression  effects.  It  is  these  conditions  which  will  cause  the 
tests  for  block  effects  to  give  poor  results.  By  way  of  summary,  these 
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(I  -  ~  J)  (Q)  (I  -  7  J)  +  7  J 


r-1 

+  J,  (1  * 

1=1 


and 


f!  -  7  J)Q(I  -  7  j) 


r-l 

1  (l  +  n.)y.y!  and  l  +  n .  >  l 
i=l 


Hence,  (I  -  —  J)  (I  -  —  A' A)  ^(1  -  —  J)  has  r-l  non-zero  roots  all 
r  r  r 

greater  than  1  .  And  if  R  ■  wl,  then  (I  -  7  J)M(I  -  7  J)  has  r-l  roots 

all  greater  than  ~  . 

10  2 

With  this  lower  bound  on  the  roots  and  the  upper  bound  furnished  by 
the  sum  of  the  roots  being  equal  to  the  trace  of  the  matrix,  we  can  often 
be  assured  of  a  high  degree  of  balance  among  the  roots  without  actually 
finding  them. 


APPENDIX  II 


CHARY S  AND  GRAPHS 
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FIGURE  VII 
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